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Field of the invention 

5 This invention relates to the improvement of poultry production through the use of 

recombinant neuropeptides. The invention is premised on the discovery of gene sequences from 
chicken encoding the neuropeptides GRF (Growth Hormone Releasing Hormone) and PACAP 
(Pituitary Adenylate Cyclase-Activating Polypeptide). 

10 Background of the invention 

Poultry is a major source of protein in the western diet. However, the result of many years 
of breeding selection for fast-growing chickens and turkeys has resulted in the production of 
commercial poultry strains which have increased fat deposits. Because of this increased rat content, 
some nutritionists no longer recommend poultry over trimmed red meat. In addition, four times as 
much feed is required to produce 1 gram of fat compared to 1 gram of muscle, and so this increased 
fat content also elevates production costs (notably, the cost of feed represents over half of the 
expense of raising poultry-69% for broiler chickens, 61 % for turkeys). Accordingly, the ability to 
produce poultry with a lower fat content would have both health and economic benefits. 

Growth Hormone-Releasing Hormone (GHRH or GRF) and Pituitary Adenylate Cyclase- 
20 Activating Polypeptide (PACAP) are two members of the glucagon superfamily of proteins. They 
are neuropeptides which, amongst other activities, stimulate the release of pituitary growth hormone 
(GH), the major growth hormone in animals. In human studies, recombinant GH has been shown to 
increase lean body mass and reduce fat content in elderly adults. By extrapolation, the regulation of 
GH in agricultural animals may be useful to control growth rates and body composition. 
Accordingly, there is much interest in GRF and PACAP, and a major focus of the ongoing research 
is the search for genes which encode these neuropeptides in agriculturally important animal species, 
including poultry. 

Although rat and human GRF have been shown to stimulate GH release from chicken 
pituitary cells in vitro, a chicken GRF has not yet been reported. It is a goal of the present invention 
to provide neuropeptide gene sequences and peptide sequences which function to stimulate GH 
release in poultry. 

Summary of the invention 

This invention provides, for the first time, a gene from chicken which encodes a precursor 
polypeptide for both GRF and PACAP (this gene is referred herein to as the chicken GRF/PACAP 
gene). The chicken GRF/PACAP gene sequence is provided, along with 5' and 3' regulatory 
sequences which regulate expression of the gene. Also provided are the amino acid sequences of the 
peptides encoded by this gene (because of alternative splicing of the introns in the gene, three 
different mRN As, encoding three slightly different peptides, are produced). This invention permits 
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the generation of nucleotide sequences encoding either the GRF/PAC AP precursor polypeptide or the 
individual GRF or PACAP peptides, or both of these individual peptides. 

Other aspects of die invention involve recombinant cloning vectors including nucleotide 
sequences encoding the chicken GRF and/or PACAP peptides, and transgenic host cells transformed 
5 with such cloning vectors. Genetic constructs designed for optimal expression of the chicken 

PACAP and/or GRF peptides are also provided. These constructs include nucleotides encoding the 
chicken GRF/PACAP precursor peptide or the individual chicken PACAP or GRF peptides in 
association with regulatory sequences which control the expression of the coding sequences. For 
example, a cDNA molecule encoding the chicken GRF may be functionally linked to the 5' promoter 
10 region found upstream of the chicken GRF/PACAP gene. 

Also encompassed by this invention are nucleotide sequences which include less than the 
entire chicken GRF/PACAP gene. For example, oligonucleotide primer and probe sequences which 
are derived from the nucleotide sequences provided are included within the scope of the invention. 
Such sequences, which are typically 10-50 nucleotides in length are useful, amongst other things, for 
15 amplifying the chicken gene from various tissues, performing hybridization studies and for cloning 
corresponding gene sequences from other species. Nucleotide sequences which encode larger sub- 
parts of the chicken GRF/PACAP are also part of the invention. Such sequences include, for 
example, DNA molecules encoding the individual GRF and PACAP peptides, as well as regulatory 
regions useful in controlling gene expression. These sequences preferably include at least 10 
20 contiguous nucleotides of the disclosed GRF/PACAP gene sequence (and more preferably at least 
25, 30 or at least 50 contiguous nucleotides). In other embodiments, these nucleotide sequences 
encode a peptide capable of stimulating the release of pituitary growth hormone from chicken 
pituitary cells. 

The provision of the chicken GRF/PACAP gene sequence also enables the cloning of related 
25 genes from other species, and the production of variants on the disclosed gene sequence. These 
variant sequences are defined as sequences which hybridize under conditions of at least 75% 
stringency to the disclosed sequences and which retain the characteristic of encoding a polypeptide 
capable of stimulating the release of pituitary growth hormone from chicken pituitary cells. 

Another aspect of the invention are the purified chicken GRF and PACAP peptides. The 
30 peptides may be purified from cell extracts, for example from host cells transformed with a 

recombinant vector expressing the peptides, or they may be synthesized by standard peptide synthesis 
methods. Purified GRF and PACAP peptides may be administered to animals directly to modulate 
GH levels and thereby regulate body composition and growth rates. For example, the purified 
peptides may be administered orally to chicks in feed, or may be formulated into slow release pellets 
35 which are administered subcutaneously. Such slow-release pellets comprise the peptide combined 
with a biocompatible matrix, such as cholesterol. Other methods of adnunistration include injection 
of the peptides incorporated into a biocompatible matrix, and the use of mini osmotic pumps. The 
amino acid sequences of the disclosd GRF, PACAP and GRF/PACAP precursor polypeptides may 
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also be modified in exact sequence, while retaining the characteristic function of stimulating the 
release of pituitary growth hormone from chicken pituitary cells. Such variant amino acid sequences 
preferably include a stretch of at least 20 consecutive amino acids identical to the amino acid 
sequence of the disclosed peptides. 

The nucleotide sequences disclosed herein may also be used to enhance the growth rate or 
improve the body composition of farmed animals. For example, genetic constructs including the 
chicken GRF/PACAP gene may be introduced into chicken primordial germ cells to produce 
genetically altered chickens. Successful integration of such constructs into the chicken genome will 
produce a bird carrying additional copies of the GRF/PACAP gene which, in turn, would be 
expected to produce higher levels of the GRF/PACAP polypeptide and thereby elevated levels of 
pituitary GH. Even higher levels of GRF/PACAP expression may be obtained by using 
GRF/PACAP constructs in which the open reading frame is operably linked to a promoter known to 
direct high level expression of downstream gene sequences. Promoter sequences specific for 
particular tissues (e.g. brain or gonads) or particular developmental stages may also be employed. 
The present invention also facilitates the ready detection of transgenic birds carrying introduced 
GRF/PACAP constructs. 
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Brief description of the drawings 

Fig. 1 shows the subclone organization of the chicken GRF/PACAP gene and the cDNA encoding 
the GRF/PACAP precursor polypeptide, together with the location of PCR primers. 
Fig. 2 shows the nucleotide sequence of the chicken GRF/PACAP gene. Nucleotides comprising 
subclones 1.8, 3. 1, and 3.2 are shown along with the intron/exon boundaries and me 5'- and 3'- 
flanking regions. The translated amino acid sequence is shown in the single letter code below the 
nucleotide sequence of coded exons and both sequences are numbered on the right. The nucleotide 
numbering begins at the beginning of the clone, whereas the amino acid numbering begins at the 
initiating methionine. GRF is underlined with a solid line and PACAP is underlined with a dotted 
line. All exons are in bold capital letters with the first exon being composed completely of 5'UTR. 
GRF 1JW is encoded on two exons. The intervening inton has alternate splice sites The intron-exon 
boundary for nucleotides encoding the second part of GRF M6 is shown (+). The other splice site, 
9bp toward the 3' end, is shown by the double symbol (+ +). This splice site removes nine 
nucleotides resulting in a shortended GRF^. Only a portion of the nucleotides that encode the 
promoter region and intron 2 and 4 are shown. The remaining nucleotides are provided in Seq. I.D. 
No. 1. Within the promoter, the CAAT and TATAAA sequence motif have been underlined. 

Sequence listing 

The accompanying sequence listing comprises the following sequence information: 
Seq. I.D. No. 1: complete nucleotide sequence of the chicken GRF/PACAP gene. 
Seq. I.D. No. 2: full length cDNA encoding chicken GRF/PACAP neuropeptide precursor. 
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Seq. I.D. No. 3: amino acid sequence of the peptide encoded by full length cDNA shown in 
Seq. LD. No. 2 (including 46 amino acid GRF peptide and 38 amino acid PACAP peptide). 

Seq. I.D. No. 4: amino acid sequence of the 46 amino acid GRF peptide encoded by full 
length cDNA shown in Seq. I.D. No.2. 
i Seq. LD. No. 5: amino acid sequence of the 38 amino acid PACAP peptide. 

Seq. I.D. No. 6: alternatively spliced cDNA sequence encoding chicken GRF/PACAP 
neuropeptide precursor (alternatively spliced cDNA #1). 

Seq. I.D. No. 7: amino acid sequence of the peptide encoded by alternatively spliced cDNA 
#1 (including 43 amino acid GRF peptide and 38 amino acid PACAP peptide). 
0 Seq. I.D. No. 8: amino acid sequence of the 43 amino acid GRF peptide encoded by 

alternatively spliced cDNA #1. 

Seq. LD. No. 9: alternatively spliced cDNA sequence encoding chicken GRF/PACAP 
neuropeptide precursor (alternatively spliced cDNA #2). 

Seq. LD. No. 10: amino acid sequence of the peptide encoded by alternatively spliced cDNA 
5 #2 (including 14 amino acid truncated GRF peptide and 38 amino acid PACAP peptide). 

Seq. LD. No. 11: nucleotide sequence encoding chicken 43 amino acid GRF peptide. 
Seq. I.D. No. 12: nucleotide sequence encoding chicken 46 amino acid GRF peptide. 
Seq. LD. No. 13: nucleotide sequence encoding chicken 38 amino acid PACAP peptide. 
Seq. LD. No. 14: nucleotide sequence of primer D used in PCR amplification. 
20 Seq. LD. No. 15: nucleotide sequence of primer F used in PCR amplification. 

Seq. I.D. No. 16: nucleotide sequence of primer A used in PCR amplification. 
Seq. LD. No. 17: nucleotide sequence of primer 1 used in PCR amplification. 
Seq. LD. No. 18: nucleotide sequence of primer 2 used in PCR amplification. 
Seq. I.D. No. 19: amino acid sequence of PACAP 27 (PACAP 27 is a form of PACAP 
25 which results from alternative post-translational processing). 

Seq. LD. No. 20: amino acid sequence of GRF 29 (the first 29 amino acids of GR, this is 
believed to represent the minimally active unit of GRF). 

Detailed description of the invention 

30 I. Definitions 

Particular terms and phrases used have the meanings set forth below. 
Isolated: An "isolated" nucleic acid has been substantially separated or purified away from 
other nucleic acid sequences in the cell of the organism in which the nucleic acid naturally occurs, 
i.e., other chromosomal and extrachromosomal DNA and RNA. The term "isolated" thus 
35 encompasses nucleic acids purified by standard nucleic acid purification methods. The term also 
embraces nucleic acids prepared by recombinant expression in a host cell as well as chemically 
synthesized nucleic acids. 
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cDNA (complementary DNA): a piece of DNA lacking internal, non-coding segments 
(introns) and regulatory sequences which determine transcription. cDNA is synthesized in the 
laboratory by reverse transcription from messenger RNA extracted from cells. 

ORF (open reading frame): a series of nucleotide triplets (codons) coding for amino acids 
without any termination codons. These sequences are usually translatable into a peptide. 

Probes and primers: Nucleic acid probes and primers may readily be prepared based on the 
nucleic acids provided by this invention. A probe comprises an isolated nucleic acid attached to a 
detectable label or reporter molecule. Typical labels include radioactive isotopes, ligands, 
chemiluminescent agents, and enzymes. Methods for labeling and guidance in the choice of labels 
appropriate for various purposes are discussed, e.g., in Sambrook et al. (1989) and Ausubel et al. 
(1987). 

Primers are short nucleic acids, preferably DNA oligonucleotides 15 nucleotides or more in 
length. Primers may be annealed to a complementary target DNA strand by nucleic acid 
hybridization to form a hybrid between the primer and the target DNA strand, and then extended 
15 along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be used for 

amplification of a nucleic acid sequence, e.g., by the polymerase chain reaction (PGR) or other 
nucleic-acid amplification methods known in the art. 

Methods for preparing and using probes and primers are described, for example, in Sambrook 
et al. (1989), Ausubel et al. (1987), and Innis et al., (1990). PCR primer pairs can be derived from a 
known sequence, for example, by using computer programs intended for that purpose such as Primer 
(Version 0.5, © 1991, Whitehead Institute for Biomedical Research, Cambridge, MA). 

Purified: the term purified does not require absolute purity; rather, it is intended as a 
relative term. Thus, for example, a purified peptide or protein preparation is one in which the 
protein referred to is more pure than the protein in its natural environment within a cell. 

Operably linked: A first nucleic acid sequence is operably linked with a second nucleic acid 
sequence when the first nucleic acid sequence is placed in a functional relationship with the nucleic 
acid sequence. For instance, a promoter is operably linked to a coding sequence if the promoter 
affects the transcription or expression of the coding sequence. Generally, operably linked DNA 
sequences are contiguous and, where necessary to join two protein coding regions, in the same 
30 reading frame. 

Recombinant: A recombinant nucleic acid is one that has a sequence that is not naturally 
occurring or has a sequence that is made by an artificial combination of two otherwise separated 
segments of sequence. This artificial combination is often accomplished by chemical synthesis or, 
more commonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic 
35 engineering techniques. 

GRF: Growth hormone-releasing hormone (alternatively referred to as GHRH). 
PACAP: Pituitary adenylate cyclase-activating polypeptide. 
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GRF/PACAP precursor polypeptide: a polypeptide which includes both GRF and PACAP 
polypeptide sequences. Cleavage of this precursor polypeptide yields the individual GRF and 
PACAP polypeptides and a cryptic polypeptide. 

Additional definitions of common terms in molecular biology may be found in Lewin, B. 
"Genes IV" published by Oxford University Press. 

H. Cloning of chicken GRF/PACAP gene 

A. Materials and methods 
1. DNA amplification 

i. Amplification of the chicken GRF/PACAP mRNA 3' end 



Chicken {GaUus domesticus) brains were removed, placed immediately in liquid nitrogen and 
stored at -80°C. Total RNA was extracted with an acidic guanidinium thiocyanate method 
(Chomczynski and Sacchi, 1987), followed by purification of poly A + rich mRNA on two 
15 consecutive oligo dT 12 . 18 columns. Single stranded cDNA was synthesized with 10 M g poly A + rich 
mRNA, 2mM oligo dT* (primer E), 5 times Superscript buffer, 2mM dNTP, lOmM DTT, SU RNA 
guard (Pharmacia), and 200 U RT Superscript (BRL) to a final volume of 25,d. The reaction was 
heated to 42"C for 1.25hr and terminated by increasing the temperature to 95°C for 10 min. 

Amplification was performed in a 50^1 volume with 0.2jtg cDNA, 5U Taq, Ix Taq buffer 
20 (Promega), 200mM dNTP's, 2.5mM MgCU., and 20 pmol of primers D (5'- 

catgtttggacagaacacaacgtgagcg) (Seq. I.D. No. 14) and F (S'-cattcggatgggatcttcacggatag) (Seq. I.D. 
No. 15). The reaction was carried out for 35 cycles at 94°C for 1 min, 45°C for 1.5 min, 72°C for 
1.5 min and for a 5.3 min extension at 72°C. Amplified bands were cloned into pBluescript KS + 
(Stratagene), electroporated into XL-1 competent cells, and prepared for sequencing with an alkaline 
25 hydrolysis method (Birnboim 1983). Both strands were sequenced with l«- 35 S] dATP using the USB 
Sequenase chain termination method (Sanger et al, 1977) and CircumVent thermal cycle sequencing 
kit (New England Biolabs). All sequencing gels were 6% polyacrylamide/7M urea wedge gels, 
dried under vacuum at 80°C and exposed to Kodak XAR-5 film for 12-24h. 



30 ii. Amplification of the 5'end 

A modified version of Frohman's (1988) RACE protocol was utilized to amplify the 5' end of 
the chicken GRF/PACAP cDNA. To amplify the 5' end, ljig Poly A+ mRNA was mixed with 
lOpmol primer D, and 7/tl DEPC treated water to a final volume of 10/d, heated at 65°C for 5min, 
and then cooled on ice. Single stranded cDNA was synthesized with the above mRN A/primer 

35 mixture, *d Superscript buffer, IpM dNTP, lOmM DTT, 5U RNA guard (Pharmacia), and 200 U 
RT Superscript (BRL) to a final volume of 25,d. The reaction was heated to 42°C for 1.25hr and 
terminated by increasing the temperature to 95°C for 6min. The first strand synthesis was 
concentrated to 12.5 fil, of which 10/xl was extended with dATP, ljd water and Ifd TdT enzyme 
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(BRL). PCR conditions were identical to the above except for the use of primers D and E (oligo 

iii. Amplification of GRF/PACAP mRNA splice variants 

Brains were removed from 25-day-old chickens and extracted in TriZol (BRL). 
Complementary DNA was synthesized from Ifig of total RNA using 200U avian reverse 
transcriptase (HRT Superscript, BRL), lOmM DTT, 0.5mM each dNTP, SOU RNA guard, 2 M M 
primer E, and IX tt RT buffer for a total reaction volume of 20/xl. The reaction proceeded for 
90min at 41 °C followed by 10 min at 90°C. PCR amplifications were done with 0.5/il of newly 
transcribed single stranded cDNA from each tissue, 5U Taq DNA polymerase, lx Taq buffer 
(Promega), 0.2mM each dNTP, 0.4/iM of primers A (5'-gagccccgcccgtgcttaccgcag) (Seq. I.D. No. 
16) and D (Fig. 1), and 2.5mM MgCl 2 in a 50/i! reaction for 35 cycles (94 0 (1>55°(1.5> 
72 0 (1.75 , ). PCR reactions were purified through a 1.5% agarose gel. Bands were isolated, cloned 
into pBluescript KS-f (Stratagene), electroporated into XL-1 competent cells, and prepared for 
sequencing with an alkaline hydrolysis method (Birnboim 1983). Both strands were sequenced with 
[<x- 35 SJ dATP using the USB Sequenase chain termination method (Sanger et aL, 1977) and 
CircumVent thermal cycle sequencing kit (New England Biolabs). 

iv. Reverse transcriptase/PCR assay 

Brain, ovary/oviduct, testis, pituitary, heart, liver, kidney, crop, small intestine, large 
intestine, eye, and the muscle were removed from 25-day-old chickens and extracted in TriZol 
(BRL). Complementary DNA was synthesized from l/*g of total RNA using 200U avian reverse 
transcriptase (H RT Superscript, BRL), lOmM DTT, 0.5mM each dNTP, 50U RNA guard, 2 M M 
primer E, and IX H~ RT buffer for a total reaction volume of 20jd. The reaction proceeded for 
90min at 41 °C followed by 10 min at 90°C. PCR amplifications were done with 0.5/il of newly 
transcribed single stranded cDNA from each tissue, 5U Taq DNA polymerase, lx Taq buffer 
(Promega), 0.2mM each dNTP, 0.4 M M of primers A and D (Fig. 1), and 2.5mM MgC^ in a 50/a1 
reaction for 35 cycles (94 0 (1')-55 0 (1.5 , )-72 0 (L75•). 



2. Genomic library screening 

A total of 10* pfu from the chicken genomic library (Stratagene) were screened with 
the 294bp PCR cDNA fragment (primers D/F). Duplicate nylon membrane (BioRad) lifts were 
prehybridized at 50°C in 6x SSC, 5x Deiihardt's solution, 0.5% SDS and 30mg/ml sea urchin DNA 
(blocking DNA) for 4 hours. The hybridization solution, consisting of 6x SSC, 0.5% SDS, and 
lOOmg blocking DNA, was added to the [a- 32 P]dCTP (Dupont) labeled probe (2.4xl0 7 cpm/ml) and 
incubated at 50°C overnight. The membranes were washed under high stringency 
(0. lxSSC/0. 1 %SDS) for 50 min at 50°C, then exposed to Kodak XAR-5 film for 7 days at -80°C. 
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Three additional rounds of screening were used to purify a positive clone that had been 
isolated from the genomic library. The insert was excised from the phage DNA with Sacl, purified 
by agarose gel electrophoresis and subcloned into pBluescript KS (Stratagene) using T4 Ligase 
(Pharmacia). Three of the four Sacl subclones were shortened by nested deletions (double stranded 
5 nested deletion kit, Pharmacia) on both strands followed by sequencing of the two strands, according 
to die manufacturer's instruction. 

3. Southern analysis 

10 Chicken liver DNA was ground and treated with proteinase K (Sigma) in buffer 

(lOmM Tris, pH 8; lOOmM EDTA, pH 8; 0.5% SDS; 200,xg/ml proteinase K) overnight at 55°C. 
The DNA was purified with three subsequent phenol:chloroform:isoamyl alcohol extractions 
(24:24:1) and one chloroform :isoamyl (24:1) extraction; end-over end mixing was carried out for 3 
hours after each addition of fresh phenol-chlorofonn-isoamyl alcohol. The DNA was dialyzed 
1 5 against TE (pH 8) overnight to remove the organic solvents. 10,ig (30/d) DNA was digested with 
either EcoRl, Sacl, Hindlll, PvuII, and Kpnl and electrophoresed through a 1 % agarose gel, in 
duplicate. The DNA was transferred as to the manufacturer's specification (BioRad) for the alkaline 
Zeta-Probe GT membrane. Prehybridization was in 7% SDS, 0.25M NaHP0 4 .pH 7.2, ImM EDTA 
at 65°C for 15min. Hybridization was in fresh prehybridization solution plus the random primed a- 
20 M PldCTP] labeled probe for 17hr at 65°C. The hybridized membranes were rinsed under low 
stringency (45°C) with 5%SDS, 40mM NaHP0 4 , ImM EDTA and then washed for 45min with 
fresh solution (45°C). The wash solution was raised to high stringency (65°C) by washing with 1 % 
SDS, 40mM NaHP0 4 , ImM EDTA and washed twice for 45min at 65°C with fresh solution. After 
washing, the membranes were sealed in plastic and exposed 24hr to Kodak Biomax film with 
25 intensifying screens at -80°C. 

B. Results 

1. Gene organization 

A single band of 294bp resulted from the first DNA amplification of chicken RNA/cDNA. 
30 This cDNA fragment was used to screen the chicken genomic library. One million clones were 

screened to produce a single plaque that hybridized to the probe. The lambda clone of approximately 
12500bp, produced 4 fragments when digested with Sacl (Figure 1). These smaller fragments were 
purified and subcloned into pBluescript KS (subclones 1.8, 3.1, 3.2, and 4.4). 

Subclones 1.8, 3.1 and 3.2 contained 6469bp of the chicken GRF/PACAP gene; clone 4.4 
35 consisted of approximately 5Kb of 3' flanking region and therefore was omitted. Subclone 1.8 

(1682bp) contained exons 3, 4, and 5 encoding the cryptic peptide, GRF, and PACAP, respectively. 
Exon 3 containing part of the cryptic peptide was 134bp in length and contained the nucleotide 
reading frame that encodes a dibasic processing site (Lys-Arg) between the cryptic and GRF peptide. 
Exon 4 has 96 nucleotides that code for the initial 32 amino acids of chicken GRF. The final 
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portion of GRF on exon 5 shows that chicken GRF is unique because it is 46 amino acids, the longest 
known GRF. On exon 5 immediately downstream of the coding region of chicken GRF^ 6 is the 
coding region for PACAP^ separated from GRF by a Lys-Arg processing site. The chicken 
PAC AP^g is identical to the mammalian form except at position #2 which has an isoleucine 
substituted for an alanine. Clone 3. 1 (2160bp) contained exon 1 that encoded the 5* -untranslated 
region (UTR) (194bp), an intron (142bp) and exon 2 that encoded the signal peptide and a portion of 
the cryptic peptide (1 14bp). Clone 3. 1 also contained 344bp of regulatory region. Within the 344bp 
was a CAAT regulatory region at position 2974bp, and the TATAA at position 3002bp; these 
nucleotides are consensus regulatory regions not found in other pacap genes. Intron 2 of 1337bp was 
the longest intron and introns 3 of 178bp and 4 of 371bp contained an unusually high G/C content and 
numerous (G) n repeats. Clone 3.2 (2627bp) was exclusively promoter. 

2. Alternative Splicing 

To confirm intron-exon boundaries, brain mRN A/cDNA was screened using the PCR and 
primers A and D. All intron-exon boundaries were confirmed, however the boundary between exons 
4 and 5 was found to vary: in some bands, the first boundary was at position 5703bp, the second 
boundary had slid nine bases downstream to position 5712 bp and the third boundary shows that exon 
4 is lacking altogether. 



3. Tissue expression 

In juveniles at 25 days after hatching, chicken GRF/PACAP mRN A was detected not only in 
the brain, but also in tissues external to the brain using a RT/PCR method. GRF/PACAP mRNA 
expression was detected within the brain, ovary /oviduct and testis of the chicken. Expression was 
not detected within the pituitary, heart, liver, kidney, crop, small intestine, large intestine, eye, or 
the muscle. From the brain mRNA, two bands were amplified from the RT/PCR method. These 
two bands, along with the single bands from the ovary/oviduct and testis, were purified and 
sequenced to verify the PACAP sequence. The longest band in die brain and single band in gonads 
contained all exons, whereas the shorter band in the brain lacked exon 4. The cDNA appeared to be 
of good quality as determined by the PCR products obtained with tubulin primers. 

4. Southern analysis 

Southern analysis of chicken genomic DNA using the 294bp PCR cDNA fragment as a DNA 
probe revealed two bands. All five genomic DNA restriction digests had two areas hybridizing to 
the cDNA probe. Both bands appeared when low and high stringency washes were applied to the 
membrane and no other bands appeared with low stringency washes. 
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C. Discussion of results 

1 . Gene organization reveals two 

neuropeptides encoded in one gene 

5 We have isolated from a chicken (c) genomic library a clone that encodes both a GRF peptide 

and PACAP. This is the first report of an avian GRF. Both PACAP and GRF belong to the 
glucagon superfamily in which the members have similar intron/exon organization and sequence 
identity. For example, the amino acid identity between chicken GRF and PACAP is 30%. This 
organization in which both peptides are encoded on the same gene is similar to that in fish (see U.S. 

10 patent application serial No. 08/062,472), but unlike mammals, which have two genes encoding each 
peptide separately. 

The association of chicken PACAP (cPAC AP) with members of a superfamily including 
glucagon, secretin, GRF, and vasoactive intestinal peptide (VIP) is illustrated by a high sequence 
identity between cPACAP and eVIP (Talbot et al. , 1995). In comparing mRNAs, the nucleotides 
15 encoding the cPACAP region have 80% identity with the nucleotides encoding the cVIP region. 
This high degree of identity likely explains the observation of two bands hybridizing with the 
cPACAP probe on Southern blots. 

The nucleotides of the cPACAP coding region are 92% identical to the human PACAP gene. 
The deduced cPACAP amino acid sequence is 97 % identical to the human sequence with the only 
20 change being at position 2 where an isoleucine is substituted for an alanine. In contrast, the chicken 
GRF (cGRF) peptide has only 42% amino acid identity to human, 47% to rat and 76% to carp GRF 
(Vaughan et al., 1992). This divergence among species is not surprising in view of the relatively 
low sequence identity of 68% between human and rat GRF. 

25 2. Alternative splicing produces 3 different mRNAs 

The chicken GRF/PACAP gene is composed of 5 exons. All 5 exon locations and 
intron/exon boundaries were confirmed by isolating cDNA clones from the 5" and 3' RACE 
reactions with adult brain cDNA (Figure 5). However, in sequences of the PCR fragments, we 
30 observed that the intron/exon boundary between exons 4 and 5 has considerable variation (Figure 5). 
The dominate boundary occurs at position 5703bp; the second boundary slides 9bp downstream to 
position 5712; and the third boundary shows that exon 4 is lacking altogether. At both splice sites 
nine bases apart, proper consensus splice sites exist. Therefore, the chicken GRF/PACAP mRNA 
transcript has splice donor sites that encode a 43-amino-acid GRF. The acceptor site was also shown 
35 to shift 9bp upstream to encode a GRF of 46 residues. The intron nucleotides at the 5* splice site of 
intron 4, AG:GT(A) and the last 4 nucleotides of intron 4 at both 3' splice sites (NCAG:C), closely 
match the splice site consensus sequences as found in vertebrates (Padgett et al., 1986; Green 1991). 

This pattern of alternative splicing has not been reported for transcripts in this family of 
peptides. The function of the alternative splicing is not known other than to encode two GRFs of 
40 different length with, potentially, two different functions. Recent evidence suggests that human 
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GRF,^ in the chicken may affect somatotroph differentiation in the embryonic chicken pituitary 
(Porter et al., 1995) and the development of chick neuroblasts and their neurotransmitters (Kentori 
and Vernadakis 1990). These effects on early brain development and GH-releasing somatotrophs in 
the pituitary may reflect an early role of GRF,^ 3 and/or GRF^ 6 in avian systems. 

The final mRNA transcript synthesized is a cDNA for chicken GRF/PACAP that lacks exon 
4, which encodes GRF un . The critical part of the peptide is thought to be contained in the missing 
sequence as mammalian GRF^ is the core required for full biological activity (Ling et al., 1984). 
The importance of GRF during development is implied by the absence of exon deletion in the embryo 
prior to hatching. This deletion of exon four has been reported for cDNAs from three other family 
members (Parker et aL, 1993; Seugkwon et al, 1995; Talbot et al., 1995). 

As a result of this alternative splicing, three cDNA sequences may be derived from the 
GRF/PACAP gene. They are as follows: 

1. "Full length" cDNA (Seq. I.D. No. 2) encoding 

GRF/PACAP precursor polypeptide (Seq. I.D. No. 3) including 46 amino acid GRF peptide (Seq. 
15 I.D. No. 4) and 38 amino acid PACAP peptide (Seq. I.D. No. 5). 

2. "Alternatively spliced cDNA #1" (Seq. I.D. No. 6) encoding GRF/PACAP precursor 
polypeptide (Seq. I.D. No. 7) including 43 amino acid GRF peptide (Seq. I.D. No. 8)and 38 amino 
acid PACAP peptide (Seq. I.D. No. 5). 

3. "Alternatively spliced cDNA #2" (Seq. I.D. No. 9) encoding GRF/PACAP precursor 
20 polypeptide (Seq. I.D. No. 10) including the presumptively non-functional 14 amino acid truncated 

GRF peptide and 38 amino acid PACAP peptide (Seq. I.D. No. 5). 

The nucleotide sequences comprising the open reading frames of the 43 and 46 amino acid 
GRFs and the 38 amino acid PACAP are shown in Seq. I.D. Nos. 11, 12 and 13, respectively. 

25 m. Physiological activity of GRF/PACAP neuropeptides 

In animal systems studied to date, purified GRF and PACAP have been shown to stimulate 
the release of GH. For example, PACAP releases GH from mouse and rat clonal pituitary cell lines 
(Propato-Mussafiri et al., 1992), and human GRF initiates the release of GH from chicken pituitary 
cells both in vitro (Perez et al. , 1987) and in vivo (Scanes and Harvey, 1984). The ability of 

30 purified GRF and PACAP to stimulate GH release may therefore be regarded as a defining 
functional characteristic of these peptides. 

The ability of the chicken GRF and PACAP peptides to stimulate the release of GH from 
chicken pituitary cells may readily be confirmed using the procedure described by Perez et al. (1987, 
incorporated herein by reference). The assay procedure described by Perez et al. (1987) may also be 

35 used to determine whether variant forms of the chicken GRF and PACAP peptides, produced as 
described in Section V below, retain the ability to stimulate GH release. 
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IV. Preferred method for making GRF/PACAP genes and cDNAs 

The foregoing discussion describes the original means by which the chicken GRF/PACAP 
gene was obtained and also provides the nucleotide sequence of this gene and of cDNAs produced 
5 from this gene. With the provision of this sequence information, the polymerase chain reaction 

(PCR) may now be utilized in a more direct and simple method for producing the GRF/PACAP gene 
and the disclosed cDNA sequences. 

To amplify the cDNA sequences, total RNA is extracted from chicken brain cells as 
described above. The extracted RNA is then used as a template for performing the reverse 
10 transcription-polymerase chain reaction (RT-PCR) amplification of cDNA. Methods and conditions 
for RT-PCR are described above and in Kawasaki et al. (1990). The selection of PCR primers will 
be made according to the portions of the cDNA which are to be amplified. Primers may be chosen 
to amplify small segments of a cDNA or the entire cDN A molecule. Variations in amplification 
conditions may be required to accommodate primers of differing lengths; such considerations are 
15 well known in the art and are discussed in Innis et al. (1990). For example, the open reading frame 
of the chicken GRF cDNA molecule may be amplified using the following combination of primers: 

primer 1 5' CACGCCGATGGGATCTTCAGCAAA 3' (Seq. I.D. No. 17) 
primer 2 5' CCCGACCCGCTTGGCCATCAGGGA 3'(Seq. I.D. No. 18) 

20 

These primers are illustrative only; it will be appreciated by one skilled in the art that many different 
primers may be derived from the provided cDNA sequence in order to amplify particular regions of 
the cDNAs encoding GRF, PACAP or the GRF/PACAP precursor. 

Alternatively, the gene sequence encoding the GRF/PACAP precursor polypeptide (i.e. the 
25 genomic sequence including introns) or pieces thereof may be obtained by amplification using 
primers based on the presented gene sequence and genomic chicken DNA as a template. 

V. Production of GRF/PACAP sequence variants 

It will be apparent to one skilled in the art that the biochemical activity of the chicken GRF 

30 and PACAP peptides may be retained even though minor variations are made to the nucleotide 
sequences encoding them. Thus, a nucleic acid sequence could be designed that encodes for the 
chicken GRF peptide, but which differs by reason of the redundancy of the genetic code, from the 
exact GRF cDNA sequence disclosed herein. Therefore, the degeneracy of the genetic code further 
widens the scope of the present invention as it enables major variations in the nucleotide sequence of 

35 a DNA molecule while maintaining the amino acid sequence of the encoded protein. 

For example, the seventh amino acid residue in the chicken GRF peptide is serine. This is 
encoded in the GRF gene by the nucleotide codon triplet AGC. Because of the degeneracy of the 
genetic code, five other nucleotide codon triplets-TCT, TCA, TCG, TCC, and AGT-also code for 



WO 98/32857 B PCT/CA98/00033 

serine. Accordingly, the nucleotide sequence of the GRF gene or cDNA could be changed at this 
position to any one of these five codons without affecting the amino acid composition of the encoded 
GRF peptide or the functional characteristics of the peptide. The genetic code and variations in 
nucleotide codons for particular amino acids is presented in Tables 1 and 2. Based upon the 
degeneracy of the genetic code, variant DNA molecules may be derived from the DNA sequences 
disclosed herein using standard DNA mutagenesis techniques, or by direct chemical synthesis of 
DNA sequences. 
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« «qi-or> <nrh>" stands for the ocre termination triplet, and 
"ItoS a2)" for the amber. ATG is the most common 
initiator codon; GTG usually codes for valine, but it can 
Sio code for methionine to initiate an mRNA chain. 
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TABLE 2 

The Degeneracy of the Genetic Code 
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Additionally, standard mutagenesis techniques may be used to produce peptides which 
vary in amino acid sequence from the disclosed GRF and PACAP peptides. Such variant peptides 
include those with variations in amino acid sequence including minor deletions, additions and 
substitutions. 

While the site for introducing an amino acid sequence variation is predetermined, the 
mutation per se need not be predetermined. For example, in order to optimize the performance of a 
mutation at a given site, random mutagenesis may be conducted at the target codon or region and the 
expressed protein variants screened for the optimal combination of desired activity. Techniques for 
making substitution mutations at predetermined sites in DNA having a known sequence as described 
above are well known. 

In order to maintain the ability of the GRF/PAC AP peptides to stimulate GH release, 
preferred peptide variants will differ by only a small number of amino acids from the GRF and 
PACAP peptide sequences disclosed herein. Preferably, such variants will be amino acid 
substitutions of single residues. Substitutional variants are those in which at least one residue in the 
amino acid sequence has been removed and a different residue inserted in its place. Such 
substitutions generally are made in accordance with the following Table 3 when it is desired to finely 
modulate the characteristics of the protein. Table 3 shows amino acids which may be substituted for 
an original amino acid in a protein and which are regarded as conservative substitutions. 



40 
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TABLE 3 
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Substantial changes in immunological and functional identity are made by selecting 
substitutions that are less conservative than those in Table 3, i.e., selecting residues that differ more 
significantly in their effect on maintaining (a) the structure of the polypeptide backbone in the area of 
30 the substitution, for example, as a sheet or helical conformation, (b) the charge or hydrophobicity of 
the molecule at the target site, or (c) the bulk of the side chain. The substitutions which in general 
are expected to produce the greatest changes in protein properties will be those in which (a) a 
hydrophilic residue, e.g., seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g., 
leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is substituted for (or by) any 
35 other residue; (c) a residue having an electropositive side chain, e.g. , lysyl, arginyl, or histadyl, is 
substituted for (or by) an electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having 
a bulky side chain, e.g. , phenylalanine, is substituted for (or by) one not having a side chain, e.g. , 
glycine. 

The present invention thus encompasses not only the precise GRF/PACAP peptides described 
40 herein but also peptides which are derived from those disclosed and which retain the ability to 

stimulate the release of GH from chicken pituitary cells in vitro. Similarly the scope of the invention 
is not limited to the precise nucleic acid sequences disclosed. 
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V. Construction of recombinant vectors for expression of GRF/PACAP peptides in 
heterologous systems 

With the provision of the nucleotide sequence for the chicken GRF/PACAP gene and 
nucleotide sequences encoding the individual chicken GRF and PACAP peptides, this invention 
enables the construction of recombinant cloning vectors for expressing any combination of the 
GRF/PACAP precursor polypeptide, the GRF peptide or the PACAP peptide (as well as variants on 
these sequences, as described in the preceding section). For example, the nucleotide sequence 
depicted in Seq. I.D. No. 2 may be selected for expression of the full length cDNA encoding the 
GRFWPACAP,.,, precursor polypeptide, whereas the sequence shown in Seq. I.D. No. 12 may be 
selected for expression of GRF M6 alone. 

The expression of these open reading frames (ORFs) in heterologous cell systems involves the 
introduction of the ORF into a vector (such as a plasmid), in such a way that the ORF is operably 
linked to regulatory sequences to direct transcription of the ORF. The recombinant vector is 
introduced into the selected host cell, which is then grown under conditions which support the 
expression of the ORF and production of the peptide sequence. Methods for expressing proteins by 
recombinant means in compatible prokaryotic or eukaryotic host cells are well known in the art and 
are discussed, for example, in Sambrook et al. (1989) and in Ausubel et al. (1987). 

The most commonly used prokaryotic hosts are strains of Escherichia coli, although other 
prokaryotes, such as Bacillus subtilis or Pseudomonas may also be used, as is well known in the art. 
For expression in a bacterial host, the selected ORF is Iigated into bacterial expression vectors. 
Methods for expressing large amounts of protein from a cloned gene introduced into Escherichia coli 
(E. cott) may be utilized for the purification of the peptide. Methods and plasmid vectors for 
producing fusion proteins and intact native proteins in bacteria are described in Sambrook et al. 
(1989) (ch. 17). Such fusion proteins may be made in large amounts, are easy to purify, and can be 
used to produce antibodies. Native proteins can be produced in bacteria by placing a strong, 
regulated promoter and an efficient ribosome binding site upstream of the cloned gene. If low levels 
of protein are produced, additional steps may be taken to increase protein production; if high levels 
of protein are produced, purification is relatively easy. 

Often, proteins expressed at high levels are found in insoluble inclusion bodies. Methods for 
extracting proteins from these aggregates are described in ch. 17 of Sambrook et al. (1989). Vector 
systems suitable for the expression of lacZ fusion genes include the pUR series of vectors (Ruther 
et al. (1983)), pEXl-3 (Stanley and Luzio (1984)) and pMRlOO (Gray et al. (1982)). Vectors 
suitable for the production of intact native proteins include pKC30 (Shimatake and Rosenberg 
(1981)), pKK177-3 (Amann and Brosius (1985)) and pET-3 (Studiar and Moffatt (1986). Fusion 
proteins may be isolated from protein gels, lyopbilized, ground into a powder and used as antigen 
preparations. 

Mammalian or other eukaryotic host cells, such as those of yeast, filamentous fungi, plant, 
insect, amphibian or avian species, may also be used for protein expression, as is well known in the 
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art. Examples of commonly used mammalian host cell lines are VERO and HeLa cells, Chinese 
hamster ovary (CHO) cells, and WI38, BHK, and COS cell lines, although it will be appreciated by 
the skilled practitioner that other prokaryotic and eukaryotic cells and cell lines may be appropriate 
for a variety of purposes, e.g., to provide higher expression, desirable glycosylation patterns, or 
5 other features. 

For expression in mammalian cells, the ORF sequence may be ligated to heterologous 
promoters, such as the SV40 promoter in the pSV2 vector (Mulligan and Berg, 1981), and 
introduced into cells, such as monkey COS-1 cells (Gluzman, 1981), to achieve transient or long- 
term expression. To achieve this, the ORF or a mini gene (a cDNA with an intron and its own 
10 promoter) is introduced into eukaryotic expression vectors by conventional techniques. These 
vectors are designed to permit the transcription of a cDN A in eukaryotic cells by providing 
regulatory sequences that initiate and enhance the transcription of the cDNA. Vectors containing the 
promoter and enhancer regions of the SV40 or long terminal repeat (LTR) of the Rous Sarcoma 
virus and polyadenylation and splicing signal from SV40 are readily available (Mulligan et al., 1981; 
15 Gormanetal., 1982). The level of expression of the cDN A can be manipulated with this type of 

vector, either by using promoters that have different activities (for example, the baculovirus pAC373 
can express cDNAs at high levels in S. frugiperda cells (Summers and Smith, 1985)) or by using 
vectors that contain promoters amenable to modulation, for example, the glucocorticoid-responsive 
promoter from the mouse mammary tumor virus (Lee et al., 1982). 
20 In addition, some vectors contain selectable markers such as the gpl (Mulligan and Berg, 

1981) or nsQ (Southern and Berg. 1982) bacterial genes. These selectable markers permit selection 
of transfected cells that exhibit stable, long-term expression of the vectors (and therefore the 
introduced ORF). The vectors can be maintained in the cells as episomal, freely replicating entities 
by using regulatory elements of viruses such as papilloma (Sarver et al., 1981) or Epstein-Barr 
25 (Sugden et al. , 1985). Alternatively, one can also produce cell lines that have integrated the vector 
into genomic DN A. Both of these types of cell lines produce the gene product on a continuous basis. 
One can also produce cell lines that have amplified the number of copies of the vector (and therefore 
of the ORF as well) to create cell lines that can produce high levels of the gene product (Alt et al., 
1978). 

30 The transfer of DNA into eukaryotic, in particular human or other mammalian cells, is now a 

conventional technique. The vectors are introduced into the recipient cells as pure DNA 
(transfection) by, for example, precipitation with calcium phosphate (Graham and vander Eb, 1973) 
or strontium phosphate (Brash et al., 1987), electroporation (Neumann et al.. 1982), lipofection 
(Feigner et al., 1987), DEAE dextran (McCuthan et al., 1968), microinjection (Mueller et al., 

35 1978), protoplast fusion (Schafner, 1980), or pellet guns (Klein et al., 1987). Alternatively, the 
cDNA can be introduced by infection with virus vectors. Systems are developed that use, for 
example, retroviruses (Bernstein et al., 1985), adenoviruses (Ahmad et al., 1986), or Herpes virus 
(SpaeteetaL, 1982). 
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This invention encompasses in pan, recombinant cloning vectors encoding the GRF/PACAP 
sequence, or portions thereof. The GRF/PACAP sequence is operadvely linked in the vector to an 
expression control sequence in the recombinant DNA molecule so that the GRF/PACAP polypeptide, 
or a portion thereof, can be expressed in a host cell. The expression control sequence may be 
selected from the group consisting of sequences that control the expression of genes of prokaryotic or 
eukaryotic cells and their viruses and combinations thereof. The expression control sequence may be 
specifically selected from a wide group of characterized regulatory sequences, including the lac 
system, the trp system, the tac system, the trc system, major operator and promoter regions of phage 
lambda, the control region of fd coat protein, the early and late promoters of SV40, promoters 
derived from polyoma, adenovirus, retrovirus, baculovirus and simian virus, the promoter for 
3-phosphoglycerate kinase, the promoters of yeast acid phosphatase, the promoter of the yeast alpha- 
mating factors and combinations thereof. 

Another aspect of the present invention is a host cell containing a recombinant vector which 
encodes the GRF/PACAP precursor polypeptide or the GRF or PACAP peptides. 

VH. Formulation of purified peptides for administration to poultry 

The chicken GRF/PACAP precursor polypeptide or the individual GRF or PACAP peptides 
may be purified from host cells as described above. Alternatively, these peptides may be chemically 
synthesized using common peptide synthesis techniques. An exemplary peptide synthesis technique 
is described in U.S. Patent No. 5,326,860, which is incorporated herein by reference. 

Once purified, these peptides may be incorporated into slow-release formulations for 
administration to chicks. Such formulations include the purified peptide and a biocompatible matrix, 
such as cholesterol. Slow release formulations may take the form of pellets, which can be 
administered subcutaneously, or may be preparations suitable for injection. The dosage of peptide 
administered will vary with the predicted speed of release in the body, but will be in the 
appropximate range of 1/ig - lOOmg for a 2kg chicken. Thus, for example, a pellet for subcutaneous 
administration may be prepared by combining 30mg of powdered cholesterol with lmg of the 
selected peptide and compressing the formulation using a standard pellet maker. The peptides may be 
pelleted alone or in combination (e.g. pellets may be made using just purified GRF, or with GRF 
combined with PACAP). Formulation of peptides into slow release preparations may be performed 
according to standard techniques, or may be performed by a commercial supplier of such materials. 

Vm. Production of antibodies to GRF and PACAP 

Monoclonal antibodies may be produced which bind the chicken GRF/PACAP precursor 
polypeptide or the individual GRF or PACAP peptides (referred to as the "target peptide"). 
Optimally, antibodies raised against any of the peptides would specifically detect the target peptide 
against which they were raised. That is, such antibodies would recognize and bind that peptide and 
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would not substantially recognize or bind to other proteins found in chicken cells. The determination 
that an antibody specifically detects a particular peptide is made by any one of a number of standard 
immunoassay methods; for instance, the Western blotting technique (Sambrook et al. , 1989). For 
example, to determine that a given antibody preparation (such as one produced in a mouse) 
5 specifically detects the chicken GRF peptide by Western blotting, total cellular protein is extracted 
from chicken cells (for example, gonad cells) and electrophoresed on a sodium dodecyl sulfate- 
polyacrylamide gel. The proteins are then transferred to a membrane (for example, nitrocellulose) 
by Western blotting, and the antibody preparation is incubated with the membrane. After washing 
the membrane to remove non-specifically bound antibodies, the presence of specifically bound 
10 antibodies is detected by the use of an anti-mouse antibody conjugated to an enzyme such as alkaline 
phosphatase; application of the substrate 5-bromo4-chloro-3-indolyl phosphate/nitre blue tetrazolium 
results in the production of a dense blue compound by immuno-localized alkaline phosphatase. 
Antibodies which specifically detect the GRF peptide will, by this technique, be shown to bind to the 
GRF peptide band (which will be localized at a given position on the gel determined by its molecular 
15 weight). Non-specific binding of the antibody to other proteins may occur and may be detectable as 
a weak signal on the Western blot. The non-specific nature of this binding will be recognized by one 
skilled in the art by the weak signal obtained on the Western blot relative to the strong primary 
signal arising from the specific antibody-GRF peptide binding. 

Substantially pure target peptide suitable for use as an immunogen is isolated from the 
20 transfected or transformed cells as described above. Concentration of the target peptide in the final 
preparation is adjusted, for example, by concentration on an Amicon filter device, to the level of a 
few milligrams per milliliter. Monoclonal antibody to the target peptide can then be prepared as 
follows: 

Monoclonal antibody to epitopes of the target peptide identified and isolated as described can 
25 be prepared from murine hybridomas according to the classical method of Kohler and Milstein 
(1975) or derivative methods thereof. Briefly, a mouse is repetitively inoculated with a few 
micrograms of the target peptide over a period of a few weeks. The mouse is then sacrificed, and 
the antibody-producing cells of the spleen isolated. The spleen cells are fused by means of 
polyethylene glycol with mouse myeloma cells, and the excess unfused cells destroyed by growth of 
30 the system on selective media comprising aminopterin (HAT media). The successfully fused cells 
are diluted and aliquots of the dilution placed in wells of a microtiter plate where growth of the 
culture is continued. Antibody-producing clones are identified by detection of antibody in the 
supernatant fluid of the wells by immunoassay procedures, such as EUSA, as originally described 
by Engvall (1980), and derivative methods thereof. Selected positive clones can be expanded and 
35 their monoclonal antibody product harvested for use. Detailed procedures for monoclonal antibody 
production are described in Harlow and Lane (1988). 

Monoclonal antibodies raised against the chicken PACAP. GRF or PACAP/GRF precursor 
peptides are useful in purifying these peptides and in detecting the presence of these peptides using 
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15 



standard biochemical techniques (such as radioimmunoassay, RIA). For example, the antibodies 
may be used to quantify levels of PACAP or GRF in poultry to which peptide pellets have been 
administered. 

IX. Introduction of GRF/PACAP gene into poultry 

The present invention also facilitates the production of transgenic poultry, expressing elevated 
levels of GRF and/or PACAP. A vector expressing the desired peptide may be produced as 
described in Section V above. It may be desirable to produce a construct expressing the 
GRF/PACAP polypeptide (or the individual GRF or PACAP polypeptides) under the control of the 
native GRF/PACAP gene promoter, such that the introduced construct expresses the encoded peptide 
in the same cells and at the same developmental stages as the native peptide is expressed. This may 
be achieved by operably linking the 5' promoter region of the GRF/PACAP gene (identified as 
nucleotide numbers 1 to 3074 of Seq. l.D. No. 1) to the selected ORF. While nucleotides 1 to 3074 
are known to include regions controlling the expression of the native GRF/PACAP gene, one of skill 
in the art will also recognize that less than this entire sequence may provide satisfactory regulation of 
gene expression. Similarly all or part of the nucleotide sequence located 3' of the native 
GRF/PACAP gene (represented as nucleotide numbers 6201 to 6529 of Seq. I.D. No. 1) may be 
operably linked to the 3' end of the selected ORF. 

Higher or constitutive levels of GRF or PACAP expression may be obtained by using 
20 GRF/PACAP constructs in which the open reading frame is operably linked to a promoter known to 
direct high level or constitutive expression of downstream gene sequences. Promoter sequences 
specific for particular tissues (e.g. brain or gonads) or particular developmental stages may also be 
employed. 

These recombinant vectors can then be introduced into chickens. Standard methods of 
producing transgenic fish are not suitable for use in chickens, in part because chicken ova are nearly 
impossible to obtain as they are fertilized inside the hen and begin to divide rapidly long before they 
are laid as eggs. Recently, new approaches have been developed, including embryonic stem cell 
methods (Pain et al., 1996) and primordial germ cell (PGC) isolation (Chang et al., 1992). PGCs 
are the precursors to ova and sperm; they are formed in the hypoblast, then move through the blood 
to the genital ridge where they settle and remain in the gonads. PGCs may thus be found in the 
blood of chicken embryos, and may be separated from the blood cells using a Ficoll gradient. Gene 
constructs may be introduced into the PGCs using a lipid carrier and the injected PGCs injected into 
host embryos. The host embryos are then incubated, hatched and allowed to mature to reproductive 
age. These chickens wUl have extra copies of the gene only in their eggs and sperm. , and may be 
bred conventionally to produce chicks which have the introduced genetic construct in all of their cells 
(the presence of the construct can readily be detected using standard PCR techniques). Successful 
transfer of PGCs from one chicken to another has already been achieved with a hatch rate of 18-30% 
(Naito etal., 1994). 



25 
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Accordingly, the present invention includes recombinant DNA molecules mat include 
sequences encoding chicken GRF or PACAP peptides or a GRF/PACAP precursor polypeptide, as 
well as transgenic non-human animals wherein the genome of these animals includes such a 
recombinant DNA molecule. 

5 

X. Cloning of related genes from other species 

This invention provides the nucleotide sequence of the chicken GRF/PACAP gene, as well as 
regulatory sequences associated with this gene. These nucleotide sequences may now be used to 
obtain corresponding and related sequences from other species. For example, the chicken 
10 GRF/PACAP gene sequence may be used as a hybridization probe to isolate corresponding 

neuropeptide genes from other avian species, such as turkey. Related avian neuropeptide genes may 
alternatively be obtained using primers derived from the sequences provided herein, in conjunction 
with standard gene amplification techniques. Hybridization probes and amplification primers useful 
in such techniques and derived from the disclosed nucleotide sequences are part of the present 
15 invention. 

By way of example, related avian neuropeptide genes may be obtained by creating a library 
of avian cDN A or genomic DNA in a plasmid, bacteriophage or phagemid vector and screening this 
library with a hybridization probe using standard colony hybridization techniques. The hybridization 
probe consists of an oligonucleotide derived from the chicken GRF/PACAP gene sequence labeled 

20 with a suitable marker to enable detection of hybridizing clones. Suitable markers include 

radionuclides, such as P-32 and non-radioactive markers, such as biotin. Methods for constructing 
suitable libraries, production and labelling of oligonucleotide probes and colony hybridization are 
standard laboratory procedures and are described in standard laboratory manuals such as Sambrook 
et al. (1989) and Ausubel et al. (1987). 

25 Having identified a clone that hybridizes with the oligonucleotide, the clone is sequenced 

using standard methods such as described in Chapter 13 of Sambrook et al. (1989). Determination of 
the translation initiation point of the DNA sequence enables the open reading frame of the cDNA to 
be detennined. 

An alternative approach to cloning genes homologous to the disclosed chicken nucleotide 
30 sequences is the use of the polymerase chain reaction (PCR). In particular, the inverse polymerase 
chain reaction (IPCR) is useful to isolate DNA sequences flanking a known sequence. Methods for 
amplification of flanking sequences by IPCR are described in Chapter 27 of Innis et al. (1990). 

Accordingly, within the scope of this invention are small DNA molecules which are derived 
from the disclosed chicken nucleotide sequence. Such small DNA molecules include 
35 oligonucleotides suitable for use as hybridization probes or polymerase chain reaction (PCR) 

primers. For use in gene amplification techniques, these oligonucleotides will preferably comprise a 
contiguous stretch of at least 10-15 nucleotides of the chicken sequences shown in Seq. I.D. No. 1 or 
the salmon sequence shown in Seq. I.D. Nos. 8 or 9. For use as hybridization probes, these 
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oligonucleotides will preferably comprise a contiguous stretch of at least 20-30 nucleotides of these 
sequences. 

Also encompassed in the present invention are nucleotide sequences which are homologous to 
the chicken GRF/PACAP precursor polypeptide gene and which hybridize to this sequence, or a 
fragment thereof, under stringent hybridization conditions. Hybridization conditions resulting in 
particular degrees of stringency will vary depending upon the nature of the hybridization method of 
choice and the composition and length of the hybridizing DNA used. Generally, the temperature of 
hybridization and the ionic strength (especially the Na + concentration) of the hybridization buffer 
will determine the stringency of hybridization. Calculations regarding hybridization conditions 
required for attaining particular degrees of stringency are discussed by Sambrook et al. (1989). 
chapters 9 and 11, herein incorporated by reference. By way of illustration only, a hybridization 
experiment may be performed by hybridization of a DNA molecule to a target DNA molecule which 
has been electrophoresed in an agarose gel and transferred to a nitrocellulose membrane by Southern 
blotting (Southern, 1975), a technique well known in the art and described in (Sambrook et al., 
15 1 989). Hybridization with a target probe labeled with [ 32 P]-dCTP is generally carried out in a 
solution of high ionic strength such as 6xSSC at a temperature that is 20-25 °C below the melting 
temperature, T m , described below. For such Southern hybridization experiments where the target 
DNA molecule on the Southern blot contains 10 ng of DNA or more, hybridization is typically 
carried out for 6-8 hours using 1-2 ng/ml radiolabeled probe (of specific activity equal to 
20 10» CPM//tg or greater). Following hybridization, the nitrocellulose filter is washed to remove 
background hybridization. The washing conditions should be as stringent as possible to remove 
background hybridization but to retain a specific hybridization signal. The term T m represents the 
temperature above which, under the prevailing ionic conditions, the radiolabeled probe molecule will 
not hybridize to its target DNA molecule. The T m of such a hybrid molecule may be estimated from 
25 the following equation (Bolton and McCarthy, 1962): 

T m = 81.5°C - 16.6(log, 0 [Na*J) + 0.41(%G+C) - 0.63(% fonnamide) - (600//) 



30 



35 



Where / = the length of the hybrid in base pairs. 

This equation is valid for concentrations of Na + in the range of 0.01 M to 0.4 M, and it is less 
accurate for calculations of T m in solutions of higher [Na *]. The equation is also primarily valid for 
DNAs whose G+C content is in the range of 30% to 75%, and it applies to hybrids greater than 
100 nucleotides in length (the behavior of oligonucleotide probes is described in detail in Ch. 11 of 
Sambrook etal., 1989). 

Thus, by way of example, for a 150 base pair DNA probe derived the open reading frame of 
the chicken GRF/PACAP precursor polypeptide gene (with a hypothetical %GC = 45%), a 
calculation of hybridization conditions required to give particular stringencies may be made as 
follows: 
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For this example, it is assumed that the filter will be washed in 0.3 xSSC solution following 
hybridization, thereby 

lNa + J - O.045M 
%GC = 45% 
5 Formamide concentration = 0 

I = 150 base pairs 

r m = 81.5 - 16(log 10 |Na + ]) + (0.41 x 45^ QQ) 
and so T m = 74.4°C. 

The r m of double-stranded DNA decreases by 1-1.5°C with every 1 % decrease in homology 
10 (Bonner et al. , 1973). Therefore, for this given example, washing the filter in 0.3 xSSC at 

59.4^4.4°C will produce a stringency of hybridization equivalent to 90%; that is, oligonucleotides 
with more than 10% sequence variation relative to the target sequence will not hybridize (such 
hybridization conditions may be referred to as "conditions of 90% stringency"). Alternatively, 
washing the hybridized filter in 0.3 xSSC at a temperature of 65.4-68.4°C will yield a hybridization 
15 stringency of 94% (conditions of 94% stringency); that is, oligonucleotides with more than 6% 

sequence variation relative to the target sequence will not hybridize. The above example is given 
entirely by way of theoretical illustration. One skilled in the art will appreciate that other 
hybridization techniques may be utilized and that variations in experimental conditions will 
necessitate alternative calculations for stringency. 
20 The present inventions encompasses nucleic acid molecules which hybridize to the specifc 

nucleic acid molecules presented in the accompanying sequence listing under conditions of high 
stringency. In preferred embodiments of the present invention, stringent conditions are those under 
which DNA molecules with more than 25% sequence variation (also termed "mismatch-) will not 
hybridize. Such hybridization conditions thus represent conditions of 75 % stringency. In a more 
25 preferred embodiment, stringent conditions are those under which DNA molecules with more than 
10% mismatch will not hybridize (conditions of 90% stringency). 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: SHERWOOD ET AL. 

5 (ii) TITLE OF INVENTION: CHICKEN NEUROPEPTIDE GENE USEFUL 

FOR IMPROVED POULTRY PRODUCTION 

(iii) NUMBER OF SEQUENCES: 20 

10 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Klarquist Sparkman Campbell Leigh & 

Whinston, LLP 

(B) STREET: One World Trade Center 
15 121 S.W. Salmon Street 

Suite 1600 

(C) CITY: Portland 

(D) STATE: Oregon 

(E) COUNTRY: United States of America 
20 (F) ZIP: 97204-2988 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Disk, 3-1/2 inch 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: MS DOS 

25 (D) SOFTWARE: WordPerfect 7.0 & ASCII 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/789,329 

<B) FILING DATE: 01/23/97 

(C) CLASSIFICATION: 
30 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Earp, David J. 
35 (B) REGISTRATION NUMBER: 41,401 

(C) REFERENCE /DOCKET NUMBER: 28 47-4 64 68/D JE 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (503) 22 6-7391 

(B) TELEFAX: (503) 228-9446 

40 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6529 bp 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 

tttttcccac agtaacaatt ctggttgaaa taataaaaag gattattttt tggatatgtt 60 
aaatactgaa attttgattt ttggactctg gtgtaatttt ttttcctggg ggttccttgc 120 
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taccaagtgt aagtatatta ttgacttttg aatccgatgg gcttttagaa aaaggagtta 
atttatatat ttggggtgtt tctctgagat atttcactcc atgaaaacag atttcttcta 240 
agcctcagca agacttgaga tcaccctaaa atgtatgcat tgttcttgtt «ccataagc 300 
atccttgagt tagccttctc tgatgtttaa atgtagggag gtgaccagaa tttgctctga 360 
5 gacacaatga caaggaaggt atgagcacag aattacagaa agggaaaaaa tacatattct 420 
actcagataa aataaagctg ttgtcaataa catgat^at tcaaactcct catctatggg 480 
aagtaagtaa ctctgttctg aatatatcta ctgtctataa ctcgacagat: cagttctgca 540 
gttcgtgttc tggaccaggg attgcgtggg gtgcactggg cacaaccaca tctttgattt 
ctttcttcaa attctcagtg tagaatgttt ttatattacc tcaaaagctt gagatacaag 660 
10 gtaaatacag gggtataggt ttaaagttct gtgtaagtgt ggggaaaata tctccacatc 720 
acagagtttg gagaaaaggc aatctgcatt tgctgatgtg cacatacaaa tttctatggg 780 
ttcttcatgc tacattaaaa gcttcacaag gaatctttcc ccaacttcca gcg tt ga«a 840 
gtgaacagcg ctattagtca ttacactcaa tatacttggt gtccttcctt gattttcctg 900 
aagaagcagc agtagggaga gctgaatcac gagttttcct catatttttc ttaatgaaaa 960 
15 atcatttgca ctgctgttct ggaacacagc accaggtaat gtcaacacat tagaaagatg 1020 
acaaaagttc ccaatcacgg gtctcgagtc agctcccttt agtgaggtta attgagcttc 1080 
caattcgaat atagtgagtc gtattacgcg cgctcactgg ccgtcgtttt acaacgtcgt 1140 
gactgggaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 1200 
gctggcgtat agcgaagagg cccgcaccga tcgcccaacc caacagttgc gcagcctgaa 1260 
20 tggcgaatgg gacgcgccct gtagcgctca ttaagcgcgg cgggaagctc taaatcgggg 1320 
ggctccttta ggttccgatt agtgctttac ggcacctcga ccccaaaaaa acttgattag 1380 
ggtgatggtt cacgtagtgg cctacgccct gatagacggt ttttttcgcc ctttgacgtt 1440 
ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat 1500 
ctcggtctat tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa 1560 
25 tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgc ttacaattta 1620 
ggtggcactt ttcggggaaa tgtgcgcggg gcccctattg tttatttttc taatacattc 1680 
aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcagtaat attgcataat 1740 
ttacaatggc gaagaaaact atacatcccc ccccagacaa tctagggtgt ctctctacct 1800 
ccaagagatc gatgtcaaga gatcgcctag aacatcgtat tatctctttc ccggagaggc 1860 
30 aagtatacaa gaagaattaa ctcaatgtga agaatacatc ttagagggat tatacgatga 1920 
ctcctaccct ctgatctggg cttcttccta tcgcgcactc tgagagaagc aaccgagaat 1980 
tcagcccttt ataagagtgt gattttagcc aaaatgaatt ccatcccacc catcccaaaa 2040 
gtcggatact gtgggttcag taccgtgcat cgatcgatcg atggagcatg catcgatgct 2100 
agcatgcatg ctagctagct attctttatt tttttttttg tatgctagat aaagctttcg 2160 
35 gtttggattt tacgacgtct tgcatacgca ctgaacttga cattgacatc tctttgcaca 2220 
cctaaactaa aaataattaa gcccattact catcctcaga gaaccaaggt aatcaaggag 2280 
tctttgcaac tgagcaaaag ctgtccttca gaaggaatct ttccccaact tccagcgttg 2340 
attagtgaac agcgctatta gtcattacac tcaatatact tggtgtcctt ccttgatttt 2400 
cctgggggaa gaagcaggag taggggagaa ctgaatcacg agttttcctc atatttttct 2460 
40 taatgaaaaa gctcattaat ggatgctgtt ctggaacaca ggaccaggta atgtcaaaac 2520 
attaaaaaga tgacaaaagt tcccaaggac ggatctcgag tcgactcaca accagaggtt 2580 
gccaggacat tgcaggactg aactttctct ttcaaccctc caatctattt tgagctctcc 2640 
agaggaaatg ggatttgtcc acagtaacca ctgtggctga gataataaac tgcattactc 2700 
tctggatatg tttaaaatac tgagacttta atatgatttt tgcatctcag tgtaatttct 2760 
45 tttccgatgc tagcatcgat gctagctggg cgggttatca tgcccaacgt cgtagctgtg 2820 
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ataaaaaaaa atgacgtctg tgctgtagct gatcgatgca tgcatgaata aaaaaagtgt 2880 
gtgtgtgact ccgtgctgat gctgtgcttg gggctttctt gctaccaagt gtaagtgcta 2940 
tgtgagttgc agcttcgcat ttgcagactc ctatgggcaa tttttagaaa aaggagttaa 3000 
tttaatataa atttggggtg tttctctgaa gatatttcac tccacagtga aaacagattt 3060 
cttctaagcc tcagGCGAAT ATTGACAGCC CCCCTTTTTT TTCCTTTATT T GT C GAGTC G 3120 
ATTCCCTAAC CACCCAACAA CTCTCTGCGC TTCTGCGCCT TCTTCATCCT TGCCCAGCGG 3180 
AAAAGCCGGG AGCCCTTTGA CTCTTTCGGC CGCAACTTGG GGAGATAGCT CTATTTTTCC 324 0 
CCCCTCCTCT CTGGGGTTTT TCTCCTTTTT CCTCTCTCCC TTTCCCTTCC GCAGCCACAC 3300 
GCTCTCAGTG CCGGGTGTCA CAgtgtgtaa atcaagactt gaggatcacc ctaaggtgta 33 60 
tgccttgttc ttgtttcagt agtacagagt gaatgaaaaa ccactggata agcatgttga 3420 
gttagcttct ctgatttggg tgtaggagtg acaagaattt gctctgagac acagGTTTCA 3480 
TGAGTGGCAA TGTGTATAAA ACGCTCTTAA CCCTCCTGGT CTATGGATTA ATAATGCATT 354 0 
GCAACGTCTA CTGCTCACCC GACCGTTGGA CTCCAGTACC CGGCGCTAAG gtgagtctgt 3600 
cagtgcaata tgctactctc acatcaggct ctgtgtcaca agtcatctgc caatctatca 3660 
gtgctgttaa gtggaattac tgagtaggtg cttggcccac caaggctgag aatccagctg 372 0 
cagtggatca gcccatctac cccctgcaca cacgtgtgga ttcaccccat cccctgccaa 378 0 
ccctgccacc ccatgctgcc ccacacagtc cctataggga tgaggctctc ccaccagggg 3840 
actgtgctgc caccatccga cactcctcta gcaaccaagc aaccaaccaa attggtgttt 3900 
tgtaataggc tgcatgctgt tttgtaatag gaaatataga tgattctact ttattctccc 3960 
tggctcctgc aacgataaga atgttctaat tcattattaa tttgttcctt ctatttctgg 4020 
taattctaaa ctatggataa cctcagtgat gccattagca atacctcgag gactacgaga 4080 
tggggtttac gtttctggac atggggtcgt ggaggcagcc cctctggcca taaggtggtt 4140 
ccctaaaccc tgacacattt gggacataac tggtacttgc cattcacact gggttacagt 4200 
gttgtgacaa acaacctgga gcacagagaa atcgtggcta gttatttgtc tggtgtaaat 4260 
gattgtagct cagcagcact ccgtgaaact tgcttattcc atctgtttat ggatttactc 4320 
tccactgagc acaaatggaa atgagggtgg gagcagggcc tagggtcttc tgccagcacc 4380 
aggggctgac tcctcccatg ggcctgccaa ggcctgcagt gacattctgc aagctagctg 4440 
gtgttagtgt agaggagggg ccatggaaca ttacatctcc aaaaggtact ttttcgatct 4500 
tggattgcac tttcataatt tatttctcta ttcaaatgat tgattgaaga acagttgtta 4560 
aaaaatctag ggaaatgata tttttattct tctgtaaaat acaaacttat ctgggtgaaa 4620 
aatgggagat tacaagtata tctgttggtc tctggaggaa attagctgca acccgtgtgg 4 680 
catgatggca cctccttgtg cagccaagcc ttctgttagg ctttccgttc atctagaaag 4740 
tgctctcctt gcctttgtgt tttgatagga atgactcctc ctttgctact cttatttcca 4800 
ctgtatgggg ttaagaagac tcgtcacgct gggctgagca ctggagcgag ctcgctccgt 48 60 
cccgcacggt cccgcggcgg ggacggggct ggggacgtcc ggctgagccc gcccgtgctt 4 920 
accgcagCTG GAGGAGGAGG TAT AC G AC GA GGACGGGAAT ACCCTACAGG ACTTCGCACT 4 980 
AC GAGC AGGA GCCCCTGGGG GTGGCGGGCC GCGCCCGCGC TGGGGCAGGT GTACGGCGCT 5040 
GTACTACCCG CCGGGAAAGA Ggtgacagag gggcgccgga tagggccggg gggggagggg 5100 
gggaatggga aacctaaggg cccccggggg aggccgggaa atatcgtaat tccgccccac 5160 
ctgggctgcg cgagcggggg aggggggtgg ggagggaggg cgcctcgggg atgggcgctg 5220 
acgggccgtg ccccggcagG CACGCCGATG GGATCTTCAG CAAAGC CTAC AGGAAACTCC 5280 
TGGGCCAGCT GTCCGCAAGA AATTAC CTGC ACTCCCTGAT GGCCAAGCGG GTCGGgtaag 5340 
ggctgcggcg ggacgggagc gaacaaagcg cggcgcgcgg cggccggggc ggggcggccc 5400 
attctccccg cggtgctctg ccggaacgag agaggcggcc gcacccgggg ctcggcgtcc 54 60 
ctcccgcggg gcagccccgg gtggtgccat cggagcgaac cccccccggg aacgcgatgc 5520 
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ataatgcatg gggggggggg ggggagacgt ctcgctccgg cccggccccg ccctttgtct 5580 
gccgggagat gcggggccgg ggcgggggtt agggccgggg ttggggttgg ggttgggtta 5640 
gggccgggtt gggtcgggcc cgggagggcc cctcctgatg gttgtgtcct tctcggtgct 5700 
ttgcagCGGT GCCAGCAGCG GCCTGGGGGA CGAGGCGGAA CCGCTCAGCA AGCGCCACAT 57 60 
AGACGGC AT C TTCAC GGAC A GCTACAGCCG CTACCGGAAA CAAATGGCTG TCAAGAAATA 5820 
CTTAGCGGCC GTCCTGGGGA AAAGGTATAA AC AAAGAGT T AAAAACAAAG GACGCCGAGT 5880 
AGC GTATTT G TAGgatgagc aaccgccgct gccgtgcgta gtcctgagag agagagagag 5 940 
agagagagag agattgagag agagagagag agagagagag acccaaccac cccaacccaa 6000 
acaaaagtca tttccaaagt gacggaacga ccgccgctcc cgtgttcccc aaacatgtat 6060 
ttatgtataa gtaagccatt aaatgaataa tattttgata ataatatggt tttcttttgt 6120 
acgaaagcac agatctactt tgtggaccaa tccttgagtt atatatgaga tagaatatat 6180 
atatataata ctgctactaa agagcgattc ttcataccaa gctgcaccag gacgagagtt 6240 
cgcctgagct gttagttttt atagaaaaca aatagacgaa aaaaaaaaaa aagacaatca 6300 
ccgcttccaa cagcgctcct atttttgtaa cggaaacgaa aagggcactg tttttattgc 6360 
cacgggggcg aacacctcag ttctcaccgt gtgcgctgtg atagggaggg ctcacgcagc 6420 
aggggtcccc ccggcctcga tctctctctc tatttccccc acccccccct tttttttttt 6480 
tccctttgat tccggtccta tccgtatcag tcctcctcag agcgatgag 6529 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

20 (A) LENGTH: 1088 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

25 gcgaatattg acagcccccc ttttttttcc tttatttgtc gagtcgattc cctaaccacc 

caacaactct ctgcgcttct gcgccttctt catccttgcc cagcggaaaa gccgggagcc 120 
ctttgactct ttcggccgca acttggggag atagctctat ttttcccccc tcctctctgg 180 
ggtttttctc ctttttcctc tctccctttc ccttccgcag ccacacgctc tcagtgccgg 240 
gtgtcacagt ttcATGAGTG GCAATGTGTA TAAAACGCTC TTAACCCTCC TGGTCTATGG 300 
ATTAATAATG C AT TGCAAC G TCTACTGCTC ACCCGACCGT TGGACTCCAG TACCCGGCGC 360 
TAAGCTGGAG GAGGAGGTAT ACGAC GAGGA CGGGAATACC CTACAGGACT TCGCACTACG 420 
AGCAGGAGCC CCTGGGGGTG GCGGGCCGCG CCCGCGCTGG GGCAGGTGTA CGGCGCTGTA 4 80 
CTACCCGCCG GGAAAGAGGC ACGCCGATGG GATCTTCAGC AAAGCC TAC A GGAAACTCCT 540 
GGGCCAGCTG TCCGCAAGAA ATTACCTGCA CTCCCTGATG GCCAAGCGGG TCGGCGGTGC 600 
35 CAGCAGCGGC CTGGGGGACG AGGCGGAACC GCTCAGCAAG CGC C AC AT AG ACGGCATCTT 660 
CAC GGAC AGC TAC AGC C GC T AC C GGAAAC A AATGGCTGTC AAGAAATACT TAGCGGCCGT 720 
CCTGGGGAAA AGGTATAAAC AAAGAGTTAA AAACAAAGGA CGCCGAGTAG CGTATTTGTA 780 
Ggatgagcaa ccgccgctgc cgtgcgtagt cctgagagag agagagagag agagagagag 
attgagagag agagagagag agagagagac ccaaccaccc caacccaaac aaaagtcatt 
40 tccaaagtga cggaacgacc gccgctcccg tgttccccaa acatgtattt atgtataagt 

aagccattaa atgaataata ttttgataat aatatggttt tcttttgtac gaaagcacag 1020 
atctactttg tggaccaatc cttgagttat atatgagata gaatatatat atataatact 
gctactaa 
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(2) INFORMATION FOR SEQ ID NO: 3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 aa 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ser Gly Asn Val Tyr Lys Thr Leu Leu Thr Leu Leu Val Tyr Gly 
1 5 10 15 

Leu lie Met His Cys Asn Val Tyr Cys Ser Pro Asp Arg Trp Thr Pro 
20 25 30 

Val Pro Gly Ala Lys Leu Glu Glu Glu Val Tyr Asp Glu Asp Gly Asn 
35 40 45 

Thr Leu Gin Asp Phe Ala Leu Arg Ala Gly Ala Pro Gly Gly Glv Glv 
50 55 60 



on P 5? Arg Pr ° Arg Trp Gly Cys Thr Ala Leu Tyr Pro Pro Glv 

20 65 70 75 qq 

Lys Arg His Ala Asp Gly He Phe Ser Lys Ala Tyr Arg Lys Leu Leu 
85 90 95 

25 Gly Gin Leu Ser Ala Arg Asn Tyr Leu His Ser Leu Met Ala Lys Arg 
100 105 no g 

Val Gly Gly Ala Ser Ser Gly Leu Gly Asp Glu Ala Glu Pro Leu Ser 
3Q 115 120 125 

Lys Arg His He Asp Gly He Phe Thr Asp Ser Tyr Ser Arg Tyr Arg 
I 30 135 140 

a* GlD Met Ala Val Lys Lys Tvr Leu A 1 * v al Leu Gly Lys Arg 

35 145 150 155 Y 160 



Tyr Lys Gin Arg Val Lys Asn Lys Gly Arg Arg Val Ala Tyr Leu 
165 170 175 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 aa 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 : 

His Ala Asp Gly He Phe Ser Lys Ala Tyr Arg Lys Leu Leu Gly Gin 
15 10 15 

Leu Ser Ala Arg Asn Tyr Leu His Ser Leu Met Ala Lys Arg Val Gly 



Gly Ala Ser Ser Gly Leu Gly Asp Glu Ala Glu Pro Leu Ser 
55 35 40 45 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 aa 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

His lie Asp Gly He Phe Thr Asp Ser Tyr Ser Arg Tyr Arg Lys Gin 
5 10 15 
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Met Ala Val Lys Lys Tyr Leu Ala Ala Val Leu Gly Lys Arg Tyr Lys 
20 25 30 

Gin Arg Val Lys Asn Lys 
35 



60 
120 
180 
240 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1079 bp 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

gcgaatattg acagcccccc ttttttttcc tttatttgtc gagtcgattc cctaaccacc 
caacaactct ctgcgcttct gcgccttctt catccttgcc cagcggaaaa gccgggagcc 
25 ctttgactct ttcggccgca acttggggag atagctctat ttttcccccc tcctctctgg 
ggtttttctc ctttttcctc tctccctttc ccttccgcag ccacacgctc tcagtgccgg 
gtgtcacagt ttcATGAGTG GCAATGTGTA TAAAACGCT C TTAACCCTCC TGGTCTATGG 300 
ATTAATAATG CATTGCAACG TCTACTGCTC ACCCGACCGT TGGACTCCAG TACCCGGCGC 3 60 
TAAGCTGGAG GAGGAGGTAT AC GAC GAGGA C GGGAAT AC C CTACAGGACT TCGCACTACG 420 
30 AGCAGGAGCC CCTGGGGGTG GCGGGCCGCG CCCGCGCTGG GGCAGGTGTA CGGCGCTGTA 480 
CTACCCGCCG GGAAAGAGGC AC GC C GATGG GATCTTCAGC AAAGC CTAC A GGAAACT C C T 540 
GGGCCAGCTG TCCGCAAGAA ATTAC CTGCA CTCCCTGATG GCCAAGCGGG TCGGCAGCGG 600 
C C T GGGGGAC GAGGCGGAAC CGCTCAGCAA GCGCCACATA GACGGCATCT T C AC GGAC AG 660 
CTACAGCCGC TACCGGAAAC AAATGGCTGT CAAGAAATAC TTAGCGGCCG TCCTGGGGAA 720 
35 AAGGTATAAA C AAAGAGT T A AAAACAAAGG ACGCCGAGTA GCGTATTT GT AGgatgagca 780 
accgccgctg ccgtgcgtag tcctgagaga gagagagaga gagagagaga gattgagaga 840 
gagagagaga gagagagaga cccaaccacc ccaacccaaa caaaagtcat ttccaaagtg 900 
acggaacgac cgccgctccc gtgttcccca aacatgtatt tatgtataag taagccatta 960 
aatgaataat attttgataa taatatggtt ttcttttgta cgaaagcaca gatctacttt 1020 
40 gtggaccaat ccttgagtta tatatgagat agaatatata tatataatac tgctactaa 1079 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 aa 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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Met Ser Gly Asn Val Tyr Lys Thr Leu Leu Thr Leu Leu Val Tyr Gly 
15 10 is 

Leu He Met His Cys Asn Val Tyr Cys Ser Pro Asp Arg Trp Thr Pro 
20 25 30 

Val Pro Gly Ala Lys Leu Glu Glu Glu Val Tyr Asp Glu Asp Glv Asn 
35 40 45 

Thr Leu Gin Asp Phe Ala Leu Arg Ala Gly Ala Pro Gly Gly Gly Gly 
50 55 60 

Pro Arg Pro Arg Trp Gly Arg Cys Thr Ala Leu Tyr Tyr Pro Pro Gly 
65 70 75 80 

Lys Arg His Ala Asp Gly He Phe Ser Lys Ala Tyr Arg Lys Leu Leu 
85 90 95 

Gly Gin Leu Ser Ala Arg Asn Tyr Leu His Ser Leu Met Ala Lys Ara 
20 100 105 no 

Val Gly Ser Gly Leu Gly Asp Glu Ala Glu Pro Leu Ser Lys Ara His 
115 120 125 

25 He Asp Gly He Phe Thr Asp Ser Tyr Ser Arg Tyr Arg Lys Gin Met 
130 135 140 

Ala Val Lys Lys Tyr Leu Ala Ala Val Leu Gly Lys Arg Tyr Lys Gin 
145 150 155 i 60 

Arg Val Lys Asn Lys Gly Arg Arg Val Ala Tyr Leu 
165 170 

35 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 aa 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

His Ala Asp Gly He Phe Ser Lys Ala Tyr Arg Lys Leu Leu Gly Gin 
1 5 - io 15 

45 Leu Ser Ala Arg Asn Tyr Leu His Ser Leu Met Ala Lys Arg Val Glv 
20 25 30 



30 



50 



Ser Gly Leu Gly Asp Glu Ala Glu Pro Leu Ser 
35 40 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 992 bp 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

gcgaatattg acagcccccc ttttttttcc tttatttgtc gagtcgattc cctaaccacc 



60 



caacaactct ctgcgcttct gcgccttctt catccttgcc cagcggaaaa gccgggagcc 120 
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ctttgactct ttcggccgca acttggggag atagctctat ttttcccccc tcctctctgg 
ggtttttctc ctttttcctc tctccctttc ccttccgcag ccacacgctc tcagtgccgg 
gtgtcacagt t 1 cATGAGTG GCAATGTGTA TAAAACGCTC TTAACCCTCC TGGTCTATGG 300 
ATTAAT AAT G CATTGCAACG TCTACTGCTC ACCCGACCGT TGGACTCCAG TACCCGGCGC 
5 TAAGCTGGAG GAGGAGGTAT AC GAC GAGGA C GGGAAT AC C CTACAGGACT T C GC AC T AC G 
AGCAGGAGCC CCTGGGGGTG GCGGGCCGCG CCCGCGCTGG GGCAGGTGTA CGGCGCTGTA 
CTACCCGCCG GGAAAGAGCG GTGCCAGCAG CGGCCTGGGG GACGAGGCGG AACCGCTCAG 
CAAGCGCCAC AT AGAC GGCA TCTTCACGGA CAGCTACAGC C GCTACCGGA AACAAATGGC 
TGTCAAGAAA TACTTAGCGG CCGTCCTGGG GAAAAGGTAT AAACAAAGAG TTAAAAACAA 
10 AGGACGCCGA GTAGCGTATT TGTAGgatga gcaaccgccg ctgccgtgcg tagtcctgag 720 
agagagagag agagagagag agagattgag agagagagag agagagagag agacccaacc 
accccaaccc aaacaaaagt catttccaaa gtgacggaac gaccgccgct cccgtgttcc 
ccaaacatgt atttatgtat aagtaagcca ttaaatgaat aatattttga taataatatg 
gttttctttt gtacgaaagc acagatctac tttgtggacc aatccttgag ttatatatga 
15 gatagaatat atatatataa tactgctact aa 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 aa 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Gly Asn Val Tyr Lys Thr Leu Leu Thr Leu Leu Val Tyr Gly 
25 1 5 10 

Leu lie Met His Cys Asn Val Tyr Cys Ser Pro Asp Arg Trp Thr Pro 



20 



25 30 



30 Val Pro Gly Ala Lys Leu Glu Glu Glu Val Tyr Asp Glu Asp Gly Asn 
35 40 

Thr Leu Gin Asp Phe Ala Leu Arg Ala Gly Ala Pro Gly Gly Gly Gly 
35 50 55 60 

Pro Arg Pro Arg Trp Gly Arg Cys Thr Ala Leu Tyr Tyr Pro Pro Gly 



65 



40 Lys Ser Gly Ala Ser Ser Gly Leu Gly Asp Glu Ala Glu Pro Leu Ser 



85 
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Lys Arg His He Asp Gly He Phe Thr Asp Ser Tyr Ser Arg Tyr Arg 
100 i° 5 110 

Lys Gin Met Ala Val Lys Lys Tyr Leu Ala Ala Val Leu Gly Lys Arg 
y U5 120 125 

Tyr Lys Gin Arg Val Lys Asn Lys Gly Arg Arg Val Ala Tyr Leu 
50 130 135 140 



(2) INFORMATION FOR SEQ ID NO: 11: 

55 (i) SEQUENCE CHARACTERISTICS: 



360 
420 
480 
540 
600 
660 
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(A) LENGTH: 129bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CAC GCC GAT GGG ATC TTC AGC AAA GCC TAC AGG AAA CTC CTG GGC CAG 4fl 
CTG TCC GCA AGA AAT TAC CTG CAC TCC CTG ATG GCC AAG CGG GTC GGC 96 
AGC GGC CTG GGG GAC GAG GCG GAA CCG CTC AGC ~ 12 9 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

15 (A) LENGTH: 138bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



CAC GCC GAT GGG ATC TTC AGC AAA GCC TAC AGG AAA CTC CTG GGC CAG 4 8 

CTG TCC GCA AGA AAT TAC CTG CAC TCC CTG ATG GCC AAG CGG GTC GGC 96 
GGT GCC AGC AGC GGC CTG GGG GAC GAG GCG GAA CCG CTC AGC 138 



25 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114bp 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CAC ATA GAC GGC ATC TTC ACG GAC AGC TAC AGC CGC TAC CGG AAA CAA 



48 



ATG GCT GTC AAG AAA TAC TTA GCG GCC GTC CTG GGG AAA AGG TAT AAA 96 
CAA AGA GTT AAA AAC AAA X J| 



(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CATGTTTGGA CAGAACACAA GTGAGCG 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CATTCGGATG GGATCTTCAC GGATAG 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24bp 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

24 

15 GAGCCCCGCC CGTGCTTACC GCAG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

20 (A) LENGTH: 2 4 bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
25 CACGCCGATG GGATCTTCAG CAAA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 24bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

35 24 
CCCGACCCGC TTGGCCATCA GGGA 

(2) INFORMATION FOR SEQ ID NO: 19: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 aa 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

His He Asp Gly He Phe Thr Asp Ser Tyr Ser Arg Tyr Arg Lys Gin 

a 5 10 

Met Ala Val Lys Lys Tyr Leu Ala Ala Val Leu 
20 25 

50 



(2) 



INFORMATION FOR SEQ ID NO: 20: 



WO 98A32857 



-39- 



PCT/CA98/00033 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 aa 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

His Ala Asp Gly lie Phe Ser Lys Ala Tyr Arg Lys Leu Leu Gly Gin 

1 5 10 15 

Leu Ser Ala Arg Asn Tyr Leu His Ser Leu Met Ala Lys 
10 20 25 
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Claims: 

L An isolated nucleic acid molecule including at least 25 contiguous nucleotides of 

the sequence shown in Seq. I.D. No. 1. 

2. An isolated nucleic acid molecule according to claim 1 wherein the molecule 
includes at least 30 contiguous nucleotides of the sequence shown in Seq. I.D. No. 1. 

3. An isolated nucleic acid molecule according to claim 1 wherein the molecule 
includes at least 50 contiguous nucleotides of the sequence shown in Seq. I.D. No. 1. 

4 An isolated nucleic acid molecule according to claim 1 wherein the nucleic acid 
molecule encodes a polypeptide capable of stimulating the release of pituitary growth hormone from 

chicken pituitary cells. 

5 A nucleotide vector including a nucleic acid molecule according to claim 4. 

6. A transgenic host cell produced by introducing into a host cell a nucleotide vector 

according to claim 5. . 
15 7 . A m ethod of producing a polypeptide capable of stimulating the release of pituitary 

growth hormone from chicken pituitary cells, the method comprising the steps of: 

(a) providing a transgenic host cell according to claim 6; 

(b) cultivating the cell under conditions supporting the production of the polypeptide; and 

(c) harvesting the polypeptide. 

8. A purified polypeptide produced according to the method of claim 7. 

9. A composition for administration to poultry comprising a purified polypeptide 

according to claim 8. 

10 An isolated nucleic acid molecule encoding a chicken neuropeptide precursor 
polypeptide, the polypeptide having an amino acid sequence selected from the group consisting of the 

25 sequences shown in: 

(a) Seq. I.D. No. 3; 

(b) Seq. I.D. No. 7; or 

(c) Seq. I.D. No. 9. 

11 A nucleotide vector including a nucleic acid molecule according to claim 10. 
12. A transgenic host cell produced by introducing into the host cell a nucleotide vector 

according to claim 11. 

,3. A method of producing a polypeptide capable of stimulating the release of pituitary 
growth hormone from chicken pituitary cells, the method comprising the steps of: 
(a) providing a transgenic host cell according to claim 12; 
35 (b) cultivating the cell under conditions supporting the production of the 

polypeptide; and 

(c) harvesting the polypeptide. 
14. A purified polypeptide produced according to the method of claim 13. 



30 



WO 98/32857 



-41- 



PCT/CA98/00033 



15. A purified polypeptide having an amino acid sequence selected from group 
consisting of the sequences shown in: 

(a) Seq. LD. No. 3; 

(b) Seq. LD. No. 7; or 

(c) Seq. LD. No. 9. 

16. A purified chicken neuropeptide precursor polypeptide, wherein the polypeptide 
includes at least 20 consecutive amino acids selected from the amino acid sequences shown in: 

(a) Seq. LD. No. 3; 

(b) Seq. LD. No. 7; or 

(c) Seq. LD. No. 9. 

and wherein the polypeptide includes both a chicken GRF peptide and a chicken PACAP peptide. 

17. An isolated nucleic acid molecule having a nucleotide sequence comprising at least 
10 consecutive nucleotides of the sequence shown in Seq. LD. No. 12, wherein the nucleic acid 
molecule encodes a polypeptide capable of stimulating the release of pituitary growth hormone from 
chicken pituitary cells. 

18. An isolated nucleic acid molecule according to claim 17 wherein the nucleic acid 
molecule encodes a chicken GRF peptide having an amino acid sequence selected from the group 
consisting of the amino acid sequences shown in: 

(a) Seq. LD. No. 4; or 

(b) Seq. LD. No. 8. 

19. A purified polypeptide encoded by a nucleic acid molecule according to claim 17. 

20. A purified chicken GRF polypeptide having an amino acid sequence selected from 
the group consisting of the amino acid sequences shown in: 

(a) Seq. LD. No. 4; and 

(b) Seq. LD. No. 8. 

21. An isolated DNA molecule wherein the molecule includes at least 10 consecutive 
nucleotides of the sequence shown in Seq. LD. No. 13 and wherein the DNA molecule encodes a 
polypeptide capable of stimulating the release of pituitary growth hormone from chicken pituitary 
cells. 

22. A DNA molecule according to claim 21 wherein the DNA molecule encodes a 
polypeptide having an amino acid sequence as shown in Seq. LD. No. 13. 

23. A DNA molecule according to claim 22 wherein the DNA molecule has a nucleotide 
sequence as shown in Seq. LD. No. 5. 

24. A recombinant DNA molecule including a nucleic acid molecule according to 

claim 4. 

25. A transgenic non-human animal wherein the genome of the animal includes a 
recombinant DNA molecule according to claim 24. 
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26 A composition hiding a purified polypeptide encoded by a nucleic acid molecule 
according to claim 4, wherein the po,ypeptide is capable of stimulating Ore release of piturtary 
growth hormone from chicken pituitary cells. . _ . 

27 . An isolated nucleic acid molecule having a nucleotide sequence selected from the 

5 group consisting of the sequences shown in: 

( a) seq. I.D. No. 1 or its complementary strand; 

(b) Seq. I.D. No. 4 or its complementary strand; 

(c) Seq 1 D. No. 12 or its complementary strand; and 

(d) sequences which hybridize under conditions of at least 75 % stringency to the 

10 sequences defined in (a)-(c); 

Z wherein the nucleic acid molecule encodes a peptide capable of stimulatmg the release of 
pituitary growth hormone from chicken pituitary cells. 

28. An isolated nucleic acid molecule according to claim 27 (d) wherein the nucletc acrd 



molecule hybridizes under conditions of at least 90% stringency. 



15 



WO 98/32857 



PCT/CA98/00033 



1/4 



c 
o 



-O 
LO 

>< 

o 

I— 
CL 
CL 
03 



CL 
-Q 
O 
O 
O 



CL 
-Q 
O 
O 
O 



1.0BS 



CO 



CD 

c 
_o 

o 



CO 



CO 

<u 
c 

o 



CL 
-Q 
Csj 
CO 
CO 



CL 
-Q 

§ 5 



CL 
-Q 

rv- 

CM 
CD 
CM 



I Jon 



I.0BS 



loes 



pes — ► 



4 



N \ 



■ \° 



\ \ \ 

V ' V \ CD 



LU 



LU 

i 
i 



CD 

Li. 



NSDOCJD: <WO_ 



_9832857A1_I_ 



SUBSTITUTE SHEET (RULE 26) 



PCT/CA98/00033 



WO 98/32857 



2/4 



o 

vO 
CM 



o 
o 



o o o o o co 

^ O OO VD ^* O 

H N M n T m H 

n n n n h 



GO 
VO 

ci d 



CM 



CM CM 
CM O 

vo r« i — co 
n m n n 




CO 

co 

CM 



ON 
CM 



t cm o co vo *r 

O H M N ^ ^ 

n n n n ^ n 



CO 
CM 

CD 



SUBSTITUTE SHEET (RULE 26) 



WO 98/32857 



PCT/CA98/00033 



3/4 



O KO H (M H (\| H 

to lo go lo to o> 



Pi 



<"*"> n co 
VD v fM n 
cd m 



Eh X 
CD 

O 

CD < 

CD 
O 

< Eh 
CD 

cd cd 

Eh 3: 

W 

u a: 
u 

H 

cd 
o 

a D-i 

CD ID 

CD CD 

EH 
CD 

O O 

CD CD 

Eh 
O 

a a, 

u 
a 

CD < 
CD CD 

3 

CD < 

rH f- 
OO lO 




XJ 


U 


CD 


ro 


o 


CP 




cn 


cn 


U 


<o 


O 


u 


o 


o 


o 


Cn 


CP 


o 


cn 


o 


CP 


o 


cp 


a 


o 


cn 


a 


o 


cn 




o 






cn 






XJ 




<0 


Cn 


(0 


4J 


O 


o 


Cn 


O 




O 


o> 


CP 


xJ 




xJ 


<a 


CP 


cn 


xj 


o 


CD 


nj 


CO 


fO 


cn 


< * 


OJ 


XJ 




cn 


o 


CD 


cn 


CP 




cn 


o 




U 


cn 




O 


cn 


d 


cn 


Cn 




o> xJ 


O CD 




fO 


CD 


cp 


CP 


cn 


CP 


O 


cn 


CP 


CJ CU 


CP 


CP 




cp 


o 


CD 


u 


XJ 


O 




o 




o 


a 




o 


cn 




o 


a 




CP 


cn 




cn 


cn 



o 
m 



»n io n 

h CO 

•-H t-H 
to 



iNSDOCID: <WO 



_9832857A1_I_> 



SUBSTITUTE SHEET (RULE 26) 



WO 98/32857 



4/4 



<=> ° "I m 

^ » rH r~l 

2h £ ^ ^ ^ H 



ao 



m 

m co 
go m 

in ri 



r- r~ r— £ ^ £ £ £ £ 




co 
o 

CP 
cO 
CP 
aJ 
(0 
CP 



CP O 
cO cO 



«j xJ 4-> 

,0 CO 4-> <0 <-> 

m cp co o « 

lj 10 <o ^ 

co o cp o 



xJ rO 

CP O 

<o a 

o o 

i-> o 



m co 

CO (0 



4J 
4-> 
4-) 

U cnal «u |0 

3 co co co xJ co co xJ 
- rp O O O cO CP O 

« O co CO O xJ CP 

cp i-> <o CP *o E . ti 

03 4J CP *0 2 ^ ~ 
10 o A * fl u ^ 

o 
o 

o cr» 4-> xj 



O 
CP 
CO 

o»04j«6 5 0coa» 

coOO*c0C0CPCP<0 



CP 4-> o o 



o > 



CO O 4J w ~ ^ - 
xjOaJ4-J CP xJ CJ O 



O O 4-J XJ XJ 



4J o o o cO 
lO CP xJ CP *J 



4-) 
4J 
4-J 
4-> 



a p: 



a* o 4J 4j ro 

«o o 4J o *o 

CP CO 4J «s 

10 CP CP 

CP O CP 5 2 m n 

«3 co 4J CO « g O 

CP CO CO 4J «5 O 4-1 



CP 4J 

CP o 

o o 



O 4J 



CO CP O CP 
O 4-) U RJ 

a o o 

4-J 

CO 



rocn4Jcocoa>Dj^ 
<- cpcpo^cpSSPT 

OOCP«3aJ jJj - ,<00>0 
10 CP *o CO CO 

^ CP 4-> CO 4J 

i3 ro cp xj fO 

#q 54 CP cO CO 4-J 
CO <0 CP CO 

CJ CP <0 4J 4J 

ft CO O 4-> CO 

2 55 CP O XJ CO - .\ 

^ ro 4J 4J CP 4-> CP O 
^ CP4JC0C04JCPXJ 



4-» 
4-> 
4J 
XJ 
4J 



H - - 

O > CP RJ 

4-> <0 

< O cO 

O cO 

xJ U 



CO CO 4J 

cO O co 

(0 CP xJ 

CP CO O 

o o o 

. cO CP 4_> 

CP cO CJ CP 

CO cO cO CJ* 

5 xJ xJ xJ CP O 04-> 

cpfocaca4j<ooj-»-J 

(0 O CO CP O CO CO 
CH4J4J CO CP cp cr 

co cp co cpcocpcau 

U CP J-^ CP -P 



<0 
CP cO 

4-> JJ 



0< 05 



CO 

4J 



CP CO cO cO 



O 4J CP o 

O 4-J CO O 

cn 4J iJ u 



CO CO 

XJ CO 



4-J 
4J 



O co 
4J XJ 
4J O 



CO 4-> 
CP 4-> 
4-> XJ 



CP O cO CP CP O CP^> 

a o o co co 4j xj xj 
cpo o cp o o 4J 

4JCOCP4JCOCPC^^ 
CP CO C04J CP xJ 

oofouoa»cp4j 

O O 4-J O co cO 4-J XJ 



4-> 



o^ucpxjajoa^^ 

a^OcacOCPCOxJxJ 



ocococacococpo 
£ O xJ U CJ [ p c ' 

o o co o o a u 



■^r cm N H 

vo cn MHO 

cs r> r-\ <r 

m in 



o 
o 

CO O 

o u 

w ^ AJ O 

, v _ 4J CP o o 
xj CP U O 4J O 



-<p:oc04JcofaiJ 

CP CO CP CP O xJ 
g± O O 4-J CP CP u 
O O CO 



* «o o 



SUBSTITUTE SHEET (RULE 26) 



INTERNATIONAL SEARCH REPORT 



lnt%- jtional Application No 

PCT/CA 98/00033 



A. CLASSIFICATION OF SUBJECT MATTER " ■ 

IPC 6 C12N15/16 C07K14/575 A61K38/22 A01K67/027 C07K14/60 



According to Internationa} Patent Classification (IPC) or to both national classification and IPC 
B. FIELDS SEARCHED — — 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C07K 



Documentation searched other than mimmumdocumentation to the extent that such documents 



are included in the fields searched 



Electronic data base consulted during the international search (name of data base and. where 



practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



EMBL databank 

Accession number A61070 

31-12-93 

yasuhara t. 

XP0020686O7 

see the whole document 

WO 96 09064 A (UNIV TULANE ; ARIMURA AKIRA 

(US)) 28 March 1996 

see figures 1,6; example 2 

WO 94 26897 A (UNIV VICTORIA INN0VAT DEV) 
24 November 1994 

see abstract; claims 41-45; figures 1-16 

EP 0 404 034 A (TAKEDA CHEMICAL INDUSTRIES 
LTD) 27 December 1990 
see claims 1-29 

-/-- 



1-9,16, 
21-24,26 



17,19,25 

1-9,16, 
21,24,26 



17,19,25 



1-3 



| X] Further documents are listed in the continuation of box C 



|X I Patent family members are listed in annex. 



6 Special categories of cited documents : 

"A" document defining the general state of the art which is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
fifing date 

"L" document which may throw doubts on priority ciaim(s) or 
which is ctted to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use. exhibition or 
other means 

"P* document published prior to the International filing date but 
later than the priority date claimed 



T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory undertying the 
invention ^ 

"X- document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of the actual completion of the international search 

18 June 1998 



Date of mailing of the international search report 

03/07/1998 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 



Authorized officer 



Gurdjian, D 



Form PCT/ISA/210 (second sheet) (July 1992) 



page 1 of 2 



NSDOCID: <WO 9832857A 1 J_> 



INTERNATIONAL SEARCH REPORT 

-~ , nncuMENTS C ONSIDERED TO BE RELEVANT 

QUUon o, docum.m. wth indiea.ion.wne,* appropnatt. o , me retevan, passage, 



Into. jUonal Applic»«lon No 

PCT/CA 98/00033 



| Relevant to claim 



p,x 



MCRORY , JOHN E. ET AL: "Ef^"^ £ne 
alternative processing of a chicken gene 
encoding both growth hormone-rel easing 
hormone and pituitary adenylate 
cyclase-activating PolyP!^* nn? 

K EL lic!r8;iUr!6^ik 9 rpJ 0 o 2 2 oe 8 eo 8 

see the whole document 



1-24, 
26-28 



2 



PCTrtSA/210 (continuation ol ««! «»<»» I** 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 

l Information on patent family members 



int. ational Application No 

PCT/CA 98/00033 



Patent document 


Publication 




Patent family 




Publication 


cited in search report 


date 




member(s) 




date 


WO 9609064 A 


28-03-1996 


M0 


9609318 


A 


28-03-1996 






AU 


3681095 


A 


09-04-1996 




94—1 1_1 QQA 


1 IC 

US 


5695954 


A 


— — — — 

09-12-1997 






A 1 I 

AU 


D/91594 


A 


12-12-1994 






CD 

hr 


0699238 


A 


06-03-1996 






MA 


954559 


A 


12-01-1996 


EP 0404034 A 


27-12-1990 


jp 


3022987 


A 


31-01-1991 






AT 


127844 


T 


15-09-1995 






CA 


2019363 


A 


20-12-1990 






DE 


69022297 


D 


19-10-1995 






DE 


69022297 


T 


22-02-1996 






US 


5198542 


A 


30-03-1993 






US 


5326860 


A 


05-07-1994 



V4. 



Form PCT/IS/V210 (patent tartly annex) (July 1992) 



3NSDOCID: <WO 9832857A1J > 



